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CONFERENCE  OVERVIEW 

R.  D.  Skeel 
Program  Committee  Chairperson 


This  meeting  is  another  in  a  series  of  single-topic 
SIGNUM  Meetings.   It  differs  from  most  of  its  predecessors  in  two 
ways.   First,  it  is  devoted  to  a  well  established  and  well  defined 
area  of  numerical  mathematics;  and  although  the  scope  of  the 
meeting  is  quite  narrow,  it  has  a  large  vertical  extent  ranging 
from  the  very  theoretical  down  to  the  very  practical.   Second,  by 
publicizing  the  meeting  well  in  advance  and  by  obtaining  the 
generous  support  of  the  National  Science  Foundation,  it  was 
possible  to  make  the  meeting  truly  international. 

The  numerical  solution  of  the  initial  value  problem 
for  ordinary  differential  equations  is  often  cited  as  one  of  the 
notable  successes  of  numerical  mathematics.   Nevertheless,  there 
are  evidently  many  who  believe  that  much  can  still  be  done,  and 
the  significant  advances  being  reported  at  this  meeting  confirm 
that  belief. 

The  planning  of  the  program  has  been  influenced  to  some 
extent  by  the  following  categorization  of  research  efforts  in 
numerical  ODEs:   (i)  theory — the  analysis  of  numerical  methods  and 
techniques,  (ii)  algorithms — the  synthesis  of  numerical  methods 
and  techniques,  (iii)  software — implementation  of  methods  and 
techniques  as  general  purpose  library  routines,  (iv)  applications — 
design  of  algorithms  and  tailoring  of  software  for  specific  areas 
of  application.   These  various  aspects  of  numerical  ODEs  might  be 
visualized  as  a  triangle  with  algorithms  in  the  center,  theory  at 
the  top  corner,  and  software  and  applications  at  the  two  supporting 
bottom  corners.   A  rough  breakdown  of  symposium  presentations  is 
fifteen  in  theory,  fourteen  in  algorithms,  five  in  software,  and 
five  in  applications.   This  preponderance  of  more  theoretical 
papers  is  balanced  by  the  more  practical  orientation  of  the 
workshop. 

The  organization  of  the  technical  program  is  in  a  large 
part  due  to  the  efforts  of  T.  E.  Hull,  J.  D.  Lambert,  L.  F.  Shampine, 
and  H.  J.  Stetter  in  reviewing  the  summaries  for  about  thirty-four 
papers  and  I  would  like  to  thank  them  very  much.   Also  appreciated 
is  the  sponsorship  by  Tom  Hull  of  the  OlympiODE. 

I  would  like  to  acknowledge  the  efforts  of  Bill  Gear  and 
Dan  Watanabe  in  the  overall  organization  of  the  conference  as  well 
as  their  advice  on  the  planning  of  the  program.   It  should  also  be 
noted  that  Bill  organized  the  workshop.   The  assistance  of  Dennis 
Gannon  and  Mitch  Roth  with  local  arrangements  is  gratefully 
acknowledged. 

The  University  of  Illinois  Department  of  Computer  Science 
has  supported  this  conference  in  numerous  ways.   Barb  Armstrong  has 
done  a  great  deal  of  secretarial  work,  and  business  manager  Clyde 
Helm  has  been  very  helpful. 

Finally,  we  are  very  pleased  to  have  so  many  of  the  top 
researchers  from  throughout  the  world  participate  in  this 
meeting.   The  efforts  and  expenditures  of  all  participants  is  much 
appreciated. 


SOME  PROPERTIES  OF  LINEAR  MULTISTEP  AND  ONE-LEG 
METHODS  FOR  ORDINARY  DIFFERENTIAL  EQUATIONS 

Germund  Dahlquist 

Royal  Institute  of  Technology 
S-IOOI4I*  Stockholm,  Sweden 


1.  I'hat  is  a  one-leg  method  and  what  relation  does 
it  have  to  a  linear  multisteD  method? 

This  paper  deals  with  the  numerical  integration  of 
the  initial  value  problem  for  the  differential 
system, 

(1.1)    dy/dt  =  f(y),   y£fs  . 

To  simplify  the  writing  we  restrict  the  discussion 
to  autonomous  systems.  The  generalization  of  re- 
sults for  non-linear  systems  to  the  non-auto- 
nomous case  is  straight-forward,  see  e.g. 
Dahlquist  0978). 

In  the  constant  stepsize  case,  tn  =  t0+nh,  a 
one-leg  method  is  written, 

k  k 

(1-2)     I  «,r^  «hf(  I    6,yn+,)  , 


j=o  J  n+J 


j=0  < 


where  Jb.=  1.  It  is  "The  one-leg  twin"  of  the  more 
familiar  linear  multistep  method, 

{K3)    Jo°JPn+0=hJoBJf(^)- 

Some  one-leg  methods  have  been  in  use  for  a 
long  time.  One  of  the  simplest  examples  is  the 
implicit  midpoint  method, 

Vl-^^n+l+V)' 
which  is  the  one-leg  twin  of  the  trapezoidal 
method, 

The  backward  differentiation  methods  are  their  own 
one-leg  twins. 

In  terms  of  the  displacement  operator  E, 
Ey  ay  ...  and  the  generating  polynomials, 
n  k  k 

pU)  =  I    a.  ?j,   a(c)  =  I    B.?j  , 


j=o 


J=0 


(1.3*)  oU)9     =  ha(E)f(y  ) 
n  n 


the  methods  can  be  written, 

(1.2')   p(E)y  =  hf (o(E)y  )    (one-leg) 
n  n' 

i    (linear  multistep) 

e  assume  that  p,o  are  relatively  prime. 

The  normalization  convention  and  the  consist- 
ency conditions  read, 

(1.1»)    0(1)  =  1,  (normalization) 

(1.5)    p(D=0,  p'(D  =  0(1)   (consistency) 

For  constant  stepsize   the  following  relations 
between  the  twins  are  well  known,  Dahlquist 
(1975a). 


a)  LINEAR  IDENTITY.    They  yield  the  same  difference 
equation  for  a  linear  autonomous  system. 

b)  NON-LINEAR  EQUIVALENCE   (first  part).  If  {y  } 
satisfies  the  one-leg  equation  (1.2')  and  if 

yn  =  o(E)yn, 

then  {y  }  satisfies  the  linear  multistep  equation 
(l.3').n 

In  order  to  prove  the  second  statement ,  one 
multiplies  (1.2')  by  the  operator  a(E)  and  makes 
use  of  aommutativity ,  o(E)p(E)  =  p(E)a(E). 

Dahlquist  ( 1975a)  introduced  one-leg  methods, 
since  they  gave  neater  formulations  in  the  analy- 
sis of  numerical  stability  for  non-linear  systems. 
There  are,  however,  other  reasons  for  the  consider- 
ation of  these  methods.  For  example,  they  give  an 
efficient  implementation  of  the   linear  multistep 
twins.    Note  that  one  needs  k  vectors  only, 
vn>vn+1  '• • •  '^n+k-1 »  in  order  to  compute  yn+k  from 
(1.2).  Then  yn  =  a(E)yn  is  computed  but  not  stored. 
The  most  straightforward  implementation  of  (1.3) 
needs  2k  values  from  the  past  (except  for  "sparse" 
methods  like  Adams  or  BDF),  if  one  does  not  re- 
compute k  old  values  of  f(y).  However,  Skeel 
(1978)  has  recently  found  other  implementations  of 
a  general  linear  multistep  method  with  the  same 
advantage  as  the  one-leg  implementation. 

A  one-leg  method  may  also  be  considered  as  a 
numerical  method  in  its  own  right.    Nevanlinna  and 
Liniger  (1979)  point  out  that  it  is  advantegeous 
compared  to  a  linear  multistep  method  in  variable 
step   computations.  On  the  other  hand,  the  latter 
has,  in  a  sense,  usually  higher  accuracy   than 
its  one-leg  twin,  see  Section  2. 

In  the  variable  stepsize  case,  let  hm=tm-tm-i. 
We  temporarily  write  a  one-leg  method  in  the  form, 


where  the  ajn,Bin,  depend  on  the  stepsizes 

•  'nn+k'  We  fi-11^'  by  dimensional  ana- 


'hn+2- 


"n+1 

lysis,  that  it  is  sufficient  to  consider  methods 

of  the  form, 
(1.6) 


Pn(E)^n 


=  h  f (0  (E)y  )  , 

in   ^  n      r\J 


n 


a 


h  >  0 
n 


where  the  coefficients  of  pn 
-!>  J  =  2,3,. 


depend  on  the 
homogeneous  of  the  first  degree  in 


3  hn+j/hn+j 


,k,  only,  and  hn  is 


1-1 


h  „    h  .,  .  The  consistency  conditions   read 
+  '  '  -  +  '  •    n+K 

pn(l)  =0,   Pn(E)tn=hn. 
The  method  (1.6)  is  normalized   if, 
.8)   P;(D  =  an(D  =  1  • 

The  condition  p  '  ( 1  )  =  a  ( 1 )  is  introduced  for  for- 
mal conveniencenonly.  It  means  no  restriction, 
since  it  will  just  give  a  natural  scaling  of  &n> 

Note  that  in  general,   pn(E)  and  ojE)   don't 
rte.  Therefore,  the  short  proof  of  non-linear 
equivalence  given  above,  does  not  work  for  arbi- 
trary changes  of  stepsize.  In  fact,  it _ follows 
from  an  argument  of  Nevanlinna  and  Liniger  (1 979, 
p.6U)  that  there  is  no  non-linear  equivalence  for 
arbitrary   stepsize  sequences.   However,  for 
geometric   stepsize  sequences,  i.e.  if 

(1.9)   hn  =  h0  •  xn  , 

the  polynomials   pn,on  do  not  depend  on  n   (although 
they  depend  on  x ) .  In  this  case  pn(E)  and  on(E) 
commute.  The  following  results  are  proved  very 
much  like  the  special  case  in  Dahlquist  (1975a). 

PROPOSITION  1.    Linear  identity   for  arbitrary   step- 
size  sequences.  For  a  linear  autonomous  system  the 
one-leg  method  (1.6)  gives  the  same  difference 
equation  as  the  linear  multistep  method, 

J. 10)  Pn(E)yn  =  hnan(E)f(yn)  . 

Non-linear  equivalence  for  geometric  stepsize 
sequences  defined  by  (1.9).  Assume  that  {yn} 
satisfies  the  one-leg  difference  equation, 

(1.11)  p(E)yn  =  vn  , 

and  let  y^   =  o(E)yn  .  Then 

(1.12)  p(E)yn  =  a(E)vn. 

If  v  =h  f(y  )  we  can  write  (1.12)  in  either  of 

n   n   n 
the  forms , 

(1.13)  p(E)?n  =  a(E)(hnf(yn))  , 

(1.13')  p(E)yn  =  hna(tE)f(yn)  . 

Conversely,  assume  that  for  n*0,  yn  satisfies 
(1.12).  Let  Pm,  ^  be  polynomials  of  degree  less 
than  k,  such  that  for  some  m,  0<m<2k-1, 

(1.11*)  Pm(Oa(^)-Qm(?)p(?)  s  ™  . 

and  let 

<1'15)  yn  =  Pm(E)Vm-VE)vn-m'   n  *  m  ' 

Then  {y  }  is  independent  of  m,  and 

(1.16)  o(E)yn  =  yn,  p(E)yn  =  rn . 

In  particular,  for  vn  =  hnf(yn)  it  follows  that 
if  {yn}  satisfies  the  linear  multistep  equation 
ten  (yr)  satisfies  the  one-leg  equation 
(1.6),  and  vice  versa.  ■ 

■umiTS. 

be  that  (1.10)  and  (1.13)  are  different. 


2)  The  sequence  vn  is  introdui 
applications  to  per  :e  equal! 
needed  in  the  error  analj    , 

Dahlquist  (1975b). 

3 )  Eqn .  (1.15)  gives  f or  m  =  n ,  n  =  0 , 1 ,2 , . . . ,k-1 , 
initial  valuer,  for  the  above  mentioned  c 
implementation  of  a  linear  multistep  met- 
with  given  initial  values,  y j  ,  J  =  0,1,...  >■'"  ■ 

2.  The  local  error 

Let  y(t)  be  an  exact  solution  of  the  differential 
system  (  1  .  1  ) ,  and  define  the  local  error  of 
the  one-leg  method  (1.2')  by, 

(2.1)   e   =  h^p  (E)y(t  )-f(a  (E)y(t  )). 
n    n  n      n      n      " 

The  displacement  operator  E  operates  on  a  seque 

space, Eym 

Here 


(Ey)  =  ym+1,  (not  on  a  function  space" 


JQajnym+j 


Pn(E)^ 
and  similarly  for  an(E).  Introduce  the  operators 

L  ip(t  )  =  p  (E)tp(t  )-h  a    (E)«p'(t 


L  ,  S  , 
n   n 


[2.2 


n   n 


n 


n' 


S  cp(t  )  =  a  (E)jp(t  )-<p((J  (E)t  ) 


"iT"  "n'    "n"  '  '  '  "n'   '  t'n' 

S  is  a  kind  of  interpolation  error  operator.  Ther 


n 


(2.3: 


-f(an(E)y(tn))  =  h^Lny(tn)  +  Sny'(tn) 
-f.Sn(y(tn))  +  0(HSny(tn)l|2). 


For  a  linear  multistep  method,  the  first  term  onl; 
is  present.  We  observe  that  we  must  put  constraint 
on  the  a-polynomial  in  order  to  obtain  the  same 
order  of  accuracy  for  the  one-leg  method _ as  for 
the  linear  multistep  method.  Choose  a  suitable 
reference  point  t*  for  Taylor  expansions  and  put, 

t   .  =t*+x.  h 

n+j   n   jn  n 

(2.U)         k  k     . 

))=.^%1n'  V°)=^n1n'^° 


qn 


(M  stands  for  "moment").  Then, 

Pn(E)yn=^oaJny(W: 
k 


I   a.  I    (T.  h  )q  y(q)(t*)/ql= 


j=(Hnq=0'  J 

=  7  hqM   (p)y(q)(t*)/q: 
'  n  qn         n 
q=0 

This  calculation  can  easily  be  modified_to  cover 
also  the  case  of  non-analytic  y(t).  Similarly, _ a 
straight-forward  calculation  yields,  if  we  assigr 
an  arbitrary  finite  value  to  M_ln(o)  , 

nqy(q)(t*)/q 
n      n 


IM   (p)-qMn  .  Jo] 
qn      q-1  ,n 


(2-5)Lny(tn)=JJ 

and,    if  we  note  that  on(E)tn = t* + \M^Jo )  , 

(2.6)  Sn(p(tn).lfM     (a)-M^(a)]hqcp(q)(t*)/qi. 


q=0' 


The  following  result  is  then  easily  obtained. 
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PROPOSITION  2.    The  order  of  accuracy  of  the  linear 
;'cep  method  (on,on)   is  the  largest  integer  p, 
such  that 


. 


H  ic) 
qn 


«*Vl.n(o)l 


,1,2.. 


The  order  of  accuracy  of  the  corresponding  one- 
leg  method   (on,on)  is  mi-n(p,r),  where  r  is  the 
largest  integer  for  which  the  following  equations 
hold, 


. 


Mo-1  ^0) 

q-i  ,n 


v 


'r. 


(a)  , 


1,: 


,r 


The  asymptotic  form  (when  h-»0 )  for  e_  is, 
(2.9)en~[M_ 


p+1,n 


■(p*DMpn(a)]hP 


n 
r(p+D 


/(p+1 


[V'Kn(4l(yl 


r+1 


V'"' 


The  derivatives  are  evaluated  at  t*,  and  f  is  the 

n      y 
Jacobian,  evaluated  at  y(t*).  Note  that  the  second 

,        .        n 
term  in  (2.9)  vanishes  when  the  system  is  linear 

and  autonomous,  as  it  should,  due  to  the  "linear 

identity  property",  see  Prop.1. 

Also  note  that  for  q=  1,  (2.8)  becomes  the 
normalization  convention  o  (1)=  1,  and  that  (2.8) 
is  trivial  for  q=2.  There  are  special  constraints 
for  one-leg  methods  for  r>  3  only. 

The  relations  (2.8)  were  first  obtained  by 
Liniger  (1978),  for  constant  stepsize,  after  that 
Warming  and  Dahlquist  had  realized  that  in  general 
the  order  will  not  exceed  2  for  one-leg  methods. 

In  view  of  the  non-linear  equivalence  (for 
constant  stepsize)  it  may  seem  paradoxical  that 
these  methods  do  not  have  the  same  "order  of 
However,  Prop.1  can  only 


accuracy 

prove  that  a(E)y 

follow  that  y 


be  used  to 
0(hv).    It  does  not 


y(o(E)t  ) 

n  „/nrv\     ■   • 

-y(t  )  =  O(hP).  Similar  observations 
n     n 
have  been  made  for  other  numerical  methods,  see 

Butcher  (1973). 


3.  Stability  and  contractivity 

Consider  the  application  of  the  method  (p,o)  with 
constant  step  h  to  the  test  equation,  y'  =  Xy,  and 
set  Yp  =  (yn»yn+l.....y5^.1)T.  The  difference 
equation  can  then  be  written  in  vector-matrix 


(3-D 


n+1 


A(Xh)Y 


where  A(Xh)  is  a  k*k  companion  matrix. 
We  recall  that  the  stability  set   is 

S  =  jxheC:  {An(Xh)}"  is  bounded!. 

The  contractivity  set   K,  Nevanlinna  and  Liniger 
(1978),  with  respect  to  a  given  norm  11*11  in  Ck 
is  {XhCC:  HA(Xh)ll  <  1}  .  *)  Obviously  KcS.  We 
use  terms  like  A-contractivity  etc.  in  obvious 
analogy  with  A-stability  etc. 
We  define  the  G~norm   of  a  vector, 


lv||2  =  v 


gv,  v  e  <r 


Let  K  be  the  corresponding  contractivity  set.  By 
the  theorem  of  Stein,  there  exists,  for  every  fixed 
Xh€S  a  matrix  G,  such  that  HA(Xh)IL  i.    1.  For  most 


r.  this  paper  a  matrix  norm  is  subordinate  to 
the  given  vector  norm. 


methods  there  is  no  G  that  can  be  used  in  the  whole 
of  S,  but  there  exists  a  useful  substitute. 

We  call  a  set  in  C  circularly  bounded 
the  closure  of  either  a  disk  or  the  complement  of  a 
disk  or  a  half-plane.  It  can  also  be  void  or  a  point . 

PROPOSITION  3.    (Dahlquist  1979,1978).  KG  is  cir- 
cularly bounded.  It  is  symmetric  with  respect  to 
the  real  axis,  if  G  is  real  and  symmetric.  To  any 
circularly  bounded  CcS  there  exists  a  G-norm,  such 
that  K  >  C. 

u  ■ 

A  corollary  is  that  A-stability  implies  A-con- 
tractivity for  some  G-norm  (a  property  previously 
called  G-stability ) .  Unfortunately,  it  also  follows 
that  for  a  method  that  is  not  A-stable,  one  cannot 
find  a  G  such  that  Kq  contains  both  0  and  °°.  For 
other  norms  K  is  usually  not  circularly  bounded. 

When  a  one-leg  method  is  applied  to  the  equation 
y'  =  X(t)y,  the  difference  is  still  of  the  form 
(3.1),  and  if  X(t)h€K  for  all  t,  then 
llA(Xtn)ll<  1  ,  Vn.  For  a  linear  multistep  method,  the 
difference  equation  becomes  more  complicated.  The 
matrix  A  depends  on  the  values  of  X  at  several 
points.  This  example  indicates  both  why  one-leg 
methods  may  be  easier  to  analyse  than  linear  mul- 
tistep methods  in  more  general  situations,  and  why 
contractivity  is  easier  to  generalize  than  plain 
stability. 

For  G-~norms ,  there  is  a  straight-forward  gener- 
alization to  nonlinear  systems  with  a  certain 
monotonicity  property   related  to  Kq,  Dahlquist 
( 1975b,  1978).  The  theory  is  given  for  constant 
stepsize  only,  but  the  extension  to  arbitrary  step- 
size  sequences  offers  no  difficulty,  provided  that 
there  is  a  G  that  can  be  used  for  all  occuring 
(p  ,a  ).  We  shall  see  in  Section  k   that  there  exist 
one-leg  2-step  methods  with  this  property.  Methods, 
which  do  not  posess  this  property,  may  be  open  for 
an  analysis,  based  on  estimation  of  the  rate  of 
change  of  G. 

For  maximum   norms ,  Nevanlinna  and  Liniger 
(1978,1979)  have  obtained  very  interesting  results . 
In  non-linear  problems  they  assume  a  kind  of  diag- 
onal dominance  for  the  Jacobian.  Their  analysis 
is  normally  restricted  to  a  smaller  class  of 
methods  than  the  G-norm  theory,  but  in  some 
respects  the  maximum  norm  theory  is  stronger,  e.g. 
in  connexion  with  Ag-contractivity.  See  also  the 
end  of  Section  h. 


4.  A-contractivity  of  two-step  methods  for 
arbitrary  stepsize  sequences 

Let 


PnU) 

anU)  = 


a2n?2  +  a1n?  +  a0n' 


2-cC       "In"'  "On 
Assume  that  the  method  is  A-stable  for  each  n,i.e. 


5  >1 


Rea  (r.)/p  (C)  >  0,  Vn  . 
n     n 


For  brevity  we  drop  n  in  the  subscripts  of  the 
coefficients.  It  must  be  remembered,  however,  that 
they  depend  on  h  ,p/n  +  -,• 

LEMMA   1.    The  consistent,  normalized  A-stable  two- 
step  methods  can  be  expressed  in  terms  of  three 
non-negative  parameters,  a,b,c, 


2a2  =  c  +  1 
(1*.1)  2d!  =  -2c 
2a0  =  c  -  1 


1*62  =  1  +  b  +  (a+c)  , 

l*6i  =  2(l-b) 

UBq  =  1  +  b  -  (a+c). 


1-3 


. 


V    =    1 


26!,    a+  c   =  2(8?  -  60). 


Formation 
=      .-+  -1).    Then  oU)/pU)   can  be   ex- 

■   ■  i 


■ 


z+c 


L ,  1  ,  C  >  0  . 


=  "  +  0/(5-0  and  match  coefficients, 
the  normalization  (1.8). 
We       iow  use  the  notations  and  results  of 
2  to  express  second  order  accuracy.    It  is 
enient  to  put  t£  =  tn+i  and  express  stepsize 

rms  of  the  parameter,  a,  defined  thus, 

:.  =  o(t2  +  t0)/2  . 

a >  0  means  increase  of  stepsize. 

We  substitute  this  and  (U.1)  into  (2.7).  It  is 
:'icient  to  consider  q=  1  and  q=2.  We  obtain 
r  some  computation: 

t  a(1-o)c  *  D.  The  method  defined  by 
able  and  second-order  accurate,  iff 

—  +  —  =1-c2,    a>0,  b>0,  c>0. 
a    1-a 

=   the  condition  reads  a=0.  If  a  =  1  and  c>0 

the  condition  reads  b=0.  If  c  =  0  then 

(l-b)(T2+T0)  =  2a,  x2-  t0  =  2. 

■ 

In  order  to  facilitate  the  comparison  with  a  more 
familiar  parameterization,  we  mention  the  following 
relations  between  (hn+2,hn+1 )  and  (a,hn),  which 
follow  from  (U.2)  and  (2.7), 


■ 


n+2 


+  1 


=  T2hn 


-t„  h 
0  n 


1  -  a  +  a/c  )h 


(1 


a/c  )h 


It  can  be  shown,  Dahlquist  (1979),  that  the 
only  possible  G-matrices  for  the  pair  (pn,on)  de- 
fined by  (H.1)  are  of  the  form  (apart  from  a  posi- 
tive multiplier), 


(It. 5) 


G  = 


g 


1+g-2c 
1-g 


1-g 
1+g+2c 


ac)2  <  Uabc 


triable  stepsize  method  is  characterized  by  the 
triplet  (a(o),  b(a),  c(a)).  We  want  G  to  be  the 
'or  all  a,  apart  from  a  scalar  multiplier. 
.lows  that  g  and  c  are  independent  of  a, 
hence  by  (b.1),  pn  has  to  be  the  same  all  the  time. 
jne  that  c *  0.  If  we  require  b(a)  to  be  continu- 
en,  by  ( U . 3 )  a(a)/a  is  continuous  at 
>,ce  a(a)>0  everywhere,  we  conclude  that 
=  a'    =0,  and  hence 

a(0)  =  0,  b(0)  =  1  -  c2  . 

,g=1,  and  hence  the  only  possi- 

'2(1-c)  0 
0   2(1+c) 


■ 


]■ 


0  <  c  <  1 


raa  earlier  obtained  by  Nevanlinna 
I  by  the 


It  is  easily  verified  thai 
that.    .        with  g=  1. 

(I4.9)   (1  -  c2  -b(ct)  -        =  Ua(o)h  a)  ■  , 

and  we  summarize  our  n 

PROPOSITION  4.      For  any  constant  c,  0 < c <  1 , 
equations  (^.8)  and  (*t.l)  define  a  second-order 
accurate  one-leg  met-   ,     ;h  is  A-contract i 
with  respect  to  the  G-norm  (k.J).    The  parame+ 
a  and  h  are  given  by  (h.i>  1. 

There  is  a  geometric  idea  behind  the  method 
(I4.8).  In  the  plane  with  cartesian  coor-i 
(ac,b),  eqn.  ( U . 3 )  defines  a  one-parameter  family 
of  straight  lines,  the  envelope  of  which  is  given 
by  (h.8).    Then  (h.9)    shows  that  this  envelope  is 
exactly  the  parabolic  boundary  of  the  domain  de- 
fined by  the  second  inequality  (lt.6),  for  g=  '. 
This  coincidence,  which  we  cannot  yet  "explain", 
indicates  that  ( U . 8 )  gives  a  complete  account  of 
methods  that  satisfy  the  restrictions  formulated 
in  this  section.  (Alternative  methods  are  perhaps 
obtained  by  dropping  the  continuity  requirement 
for  b(oe).)  It  is  remarkable  that  the  BDF  method, 
for  which  c  =  2  when  a  =  0 ,  is  excluded. 

Nevanlinna  (1979)  has  somewhat  earlier  than  we 
found  the  same  family  (although  with  a  very 
different  parametrization)  and  shown  that  it  is 
A-contractive  with  respect  to  the  maximumnorm , 
for  arbitrary  stepsize  sequences.  His  result  is 
more  flexible  than  ours,  since  it  allows  pn(?)  to 
be  changed  during  the  computations,  which  we 
cannot  do.  On  the  other  hand  our  result  covers 
(at  the  time  of  writing)  a  different  class  of 
non-linear  problems. 

The  analysis  can  easily  be  extended  to  the  case 
where  the  stability  region  is  assumed  to  include 
a  circle  tangent  to  the  imaginary  axis  at  the 
origin.  We  have  done  some  calculations  concerning 
the  unique  stable  method  with  k  =  2,  p=  3  and  some 
explicit  methods  with  p  =  k =  2 ,  but  we  must  say 
like  Fermat:  the  paper  isn't  large  enough. 
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In  this  paper  a  new  stability  criterion  for  discretization  methods  for  stiff  ordinary  differential 
equations  is  proposed. This  new  stability  concept  takes  into  account  the  variation  in  time  of  the  eigen- 
vectors of  the  Jacobian  matrix  of  the  system. As  such  the  new  criterion  complements  the  existing  criteria; 
it  does  not  replace  them. The  new  criterion  is  applied  to  multistep  methods, a  generalized  Runge-Kutta 
method. and  Galerkin  methods  (  with  numerical  quadrature) .We  refer  to  the  new  stability  criterion  as 
D-stability  because  of  the  relationship  with  the  decomposition  error. 


In  the  definition  of  D-stability  we  consider  linear  homogeneous  systems  of  the  type 

(1)  x'  *  A(t)x, 

where  fait)    is  a  matrix  and  x' ■  dx/dt.  The  matrix  Att)  may  be  obtained  as  the  Jacobian  matrix  [evaluated 
along  a  smooth  solution)  related  to  a  non-linear  system.  We  restrict  ourselves  to  discretization  methods 
which  discretize  the  system  (1)  by  the  recursion 


(2) 


y .  ,  =  G(t. ,h)y. 
*i+1      l    *i 


wt-ere  h  is  the  stepsize.t  =t  +ih,  and  where  G(t.,h)  is  a  matrix. If  the  discretization  method  is  a  one-step 

l   0  i 

method, then  y  is  the  approximation  to  x(t,).  If  the  discretization  is  a  multistep  method, then  the 
recursion  (2)  is  the  formal  one-step  recursion  corresponding  to  the  multistep  recursion  and  the  meaning 
of  y.has  to  be  changed  accordingly. 

For  stiff  systems  it  is  very  difficult  to  obtain  stability  for  practical  values  of  the  stepsize  h.  In 
order  to  obtain  valuable  information  about  discretization  methods  for  stiff  systems  for  realistic  values 
of  the  stepsize  G. G.Dahlquist  restricted  himself  to  the  scalar  model  equation  x'  =  Ax  by  introducing  his 
concept  of  A-stability ,cf .[2].  This  concept  has  been  modified  in  various  ways,  but  in  all  modifications 
the  time- dependence  of  the  eigenvectors  of  the  Jacobian  matrix  is  neglected. However, the  time-dependence 
of  the  eigenvectors  of  the  Jacobian  matrix  cannot  always  be  neglected  in  the  analysis  of  stiff  discretized 
problems. The  trapezoidal  rule  provides  an  example.  For  this  method  the  source  of  the  difficulties  is  the 
bad  decomposition  error.  As  such  the  difficulty  was  known  to  Dahlquist  and  Lindbergh]  and  Van  Veldhuizen 
[7] .H'^wever,  no  simple  criterion  for  investigating  such  phenomena  in  other  methods  seems  to  exist.  In  this 
paper  we  propose  the  concept  of  D-stability.  We  define  it  in  such  a  way  that  a  D-stable  method  does  not 
suffer  from  a  bad  decomposition  error  for  a  large  class  of  stiff  problems. 

Definition.  The  discretization  method  resulting  in  the  recursion  (2)  is  called  D-stable  if  for  all  t  in  the 
interval, all  h  £  ^°'n0^  and   a11  stiff  systems  of  a  class  3   (  see  below)  ||G(t,h)||  <  M,  M  a  constant 
independent  of  h  and  the  system  in  the  class  o< 

Consequently , D-stability  is  a  weak  concept  as  only  the  transition  in  one  step  should  be  bounded  uniformly 
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ir  the  stepsize  h.  On  the  other  hand,  D-stability  is  a  strong  concept  by  requiring  this  uniformly  in  a 
class  of  stiff  systems.  Obviously,  the  concept  of  D-stability  complements,  but  not  replaces,  the  concept 
Df  --stability  and  related  ones. Now  we  come  to  the  description  of  the  class  o   of  stiff  systems.  This  class 
jj  s-ould  be  sufficiently  large  because  otherwise  the  concept  op  D-stability  becomes  a  triviality.  On  the 
other  hand,  we  aim  at  a  simple  criterion  and  so  we  must  be  restrictive  in  the  choice  of  the  class  O  .  The 
class  J  as  described  below  seems  adequate. 

Class  0  .  The  class  S   consists  of  all  linear  systems  of  the  type  x*  =  A[t)x  satisfying  the  conditions: 
(51]   x(t)  e  £2  . 

(52)  A(t)  =  T(t)A(t)T(tr1   for  all  t.  Here  A(t)  is  a  diagonal  matrix,  A(t)  =  diag{  A(t)/e,  ptt]  }.  Also 

ReC  Alt))  <  \   <  0,and  e  £(0,eQ]. 

- 1 

(53)  X,    u,  T  and  T   depend  smoothly  on  t  and  e,  and  the  derivatives  from  order  zero  up  to  a  certain 

order  (  sufficiently  high  )  are  bounded  uniformly  in  t  and  e  e  (  0  ,  £_]. 

Obviously , the  systems  in  the  class  o   are  stiff.  In  many  cases  it  is  realistic  to  consider  a  subset  of  the 
class  J  .  E.g.  one  might  consider  systems  with  real  eigenvalues  only.  Also,  it  is  possible  to  restrict  the 
class  o   to  the  set  of  stiff  systems  with  a  particular  type  of  coupling  (strong, weak]  between  the  smooth 
component  and  the  transient  one.  In  this  survey  we  do  not  go  into  these  details. 

The  stiffness  is  introduced  in  the  class  O  by  the  eigenvalue  X(t)/e.  We  must  have  e  e  (0,£  ]  in  the  class 
O  because  otherwise  the  definition  of  D-stability  becomes  a  trivial  one.  We  now  describe  some  of  the 
results;we  look  at  three  classes  of  discretization  methods. 


Multistep  Methods.  A  multistep  method  defined  by  the  generating  polynomials  p  and  a  of  degree  £  k  i6 
D-stable  if  and  only  if  cr(z)  »  Bnz  .  This  result  once  more  justifies  the  choice  of  the  backward 
differentiation  methods  in  the  code  by  Gear[5].  However,  the  one-leg  method  (see  Dahlquist[3] ) 
corresponding  to  the  multistep  method  is  D-stable  if  the  method  is  A  -stable. This  result  emphasizes  the 
importance  of  one-leg  methods  as  introduced  by  Dahlquist[3] . 

leralized  Runge-Kutta  Methods.  Verwer[8]  introduces  the  concept  of  internal  S-stability.  Roughly  speaking, 
a  generalized  Runge-Kutta  method  is  internally  S-stable  if  it  has  good  damping  properties  at  each  stage 
when  applied  to  the  model  x'  =  Xx.  It  turns  out  that  the  internally  S-stable  methods  of  Verwer[8]  are  not 
D-stable.  This  shows  that  D-stability  is  not  a  consequence  of  good  damping  properties. 

lethods .  Here  we  restrict  ourselves  to  Galerkin  methods  with  approximate  quadrature  as  considered 
among  others  )  by  Axelsson[l]  and  Weiss[9].  With  suitable  Gaussian  quadrature  or  Radau  quadrature  these 
methods  reduce  to  collocation  methods,  while  the  corresponding  method  with  Lobatto  quadrature  is  not  a 
collocation  method.  We  have  the  following  result:  all  A-9table  collocation  methods  are  O-stablejthe  method 
1  Lobatto  points  ia  not  D-stable.  This  result  is  important  in  view  of  the  use  of  such  methods  in 
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Stability  of  explicit  time  discretizations  for 
solving  initial  value  problems 
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D-4630  Bochum,  Fed.  Rep.  of  Germany 


Olavi  itevanlinna,  Department  of  Mathematics,  Oulu  University, 
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stability  regions  of  explicit  "linear"  time  dis- 
cretization methods  for  solving  initial  value 
problems  are  treated.  A  discretization  is  called 
"linear"  if  tne  right  nand  side  of  the  differen- 
tial equation,  its  total  derivatives  and  iterates 
of  these  occur  only  linearly.  Tnus  the  class  of 
methods  considered  includes  kunge-Kutta-methods, 
linear  multistep  methods,  multistep  methods 
using  higher  derivatives,  predictor-corrector 
methods  and  cyclic  methods.  If  an  integration 
metnod  needs  m  function  evaluation  per  time  step, 
then  we  scale  the  stability  region  by  dividing 
by  m,  and  show  that  for  any  two  methods,  satis- 
fying some  reasonable  conditions,  the  boundaries 
of  the  scaled  stability  regions  necessarily 
intersect.  Using  this  comparison  result  bounds 
for  the  size  of  the  stability  regions  for  three 
different  purposes  are  given:  I)  for  "general" 
nonlinear  ordinary  differential  systems  2)  for 
systems  obtained  from  parabolic  problems  and 
3)  for  systems  obtained  from  hyperbolic  problems. 

First  we  consider  the  size  of  disks  one  can 
have  inside  the  stability  region  with  origin  on  t 
their  boundary.  In  the  scaled  sense  the  largest 
disk  is  obtained  with  the  Luler's  method  and 
for  any  smaller  disk  there  exist  explicit  linear 
k-step  methods  for  order  p  =  k  which  are  stable 
in  such  a  disk. 


A-stable  methods  with  high  order. 

For  hyperbolic  problems  one  is  interested  in 
methods  with  long  stability  intervals  along  the 
imaginary  axis.  It  is  shown  that  in  the  scaled 
sense,  the  leap-frog  method  has  the  largest  inter- 
val and  that  for  any  shorter  interval  kS  {2,3,4} 
there  are  explicit  linear  k-step  methods  which  areJ 
stable  on  that  interval  and  of  order  p  =  k. however 
for  k  =  1  mod  4  an  explicit  linear  k-step  method 
with  p  =  k  has  no  nontrivial  interval  of  stability] 
on  the  imaginary  axis. 

By  the  previous  result  about  the  largest  stability:! 
disk  with  the  origin  on  its  boundary  it  follows 
that  the  radius  of  such  a  disk  is  bounded  by  m  fori 
predictor-corrector  methods,  where  the  corrector 
is  iterated  m  -  1  times  and  the  function  is 
evaluated  m  times.  It  is  shown  that  this  disk  can  j 
be  obtained  only  if  one  uses  different  correctors  j 
at  each  iteration  step.  When  one,  as  is  customary,: 
iterates  a  fixed  corrector,  then  as  m  tends  to 
infinity  the  maximal  radius  does  not  tend  to  in- 
finity, and  in  fact  it  is  shown  that  in  the  limit  ; 
the  maximal  radius  is  2. 


For  parabolic  problems  one  is  interested  in 
methods  which  are  stable  for  long  intervals  of 
the  form  |  -a,o]  ,   a  >  o    .  It  is  shown  that  for 
explicit  linear  multistep  methods  of  order  p  =  k 
lac  length  of  the  interval  is  always  bounded  by  2, 
but  already  for  p  ■  k  -  1  one  can  construct 
methods  which  are  stable  in  bounded  sectorial 
sets  for  arbitrarily  big  radius  and  any  opening 
less  than  n  .  hence,  although  an  explicit  method 
cannot  be  A-stable  we  can  have  approximately 
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STABILITY  PROPERTIES  FOR  A  GENERAL  CLASS  OF  METHODS 
FOR  ORDINARY  DIFFERENTIAL  EQUATIONS 


J.C.  Butcher, 

University  of  Auckland, 

Auckland,  New  Zealand 


Abstract   The  concepts  of  G-stability  for  linear  multistep 
methods  and  B-stability  for  Runge-Kutta  methods  are  combined 
in  a  unified  approach  to  non-linear  stability  for  a  class  of 
methods  general  enough  to  include  these  as  special  cases. 
Recent  work  on  these  developments,  by  Kevin  Burrage  and  the 
author,  is  reviewed  and,  to  exemplify  the  new  ideas  and 
techniques,  an  algebraic  stability  analysis  is  presented 
for  the  backward  differentiation  methods  of  orders  1,  2 
and  3  • 


Introduction    The  property  of 
A-stability,  long  accepted  as  a 
desirable  condition  for  methods  to 
possess  if  they  are  to  be  used  with 
stiff  problems,  gives  only  limited 
guidance  to  the  numerical  behaviour  if 
these  problems  are  also  non-linear.   For 
this  reason,  there  has  grown  up  an 
interest  in  non-linear  problems  of  the 
form   y'(x)  =  f(y(x)),   where   y   has 
values  in  an  inner-product  space   X, 
which  are  such  that 

<u-v,f (u)-f (v)>  <  0, 

for  all   u,v  6  X  .   For  such  a  problem, 
an  immediate  consequence  is  that  for  two 
particular  solutions   y   and   z, 
By(x)-z(x)    is  a  non-increasing  function 


of   x  .   It  is  interesting  to  know 

whether  a  particular  numerical  method 

exhibits  similar  behaviour  in  the  results 

it  generates. 

This  is  simply  expressed  for  one 

step  methods  as  the  requirement  that  for 

two  solution  sequences   . . . ,y   ..  ,y  ,... 

and   ...,z   ,,z  ,...,   it  holds  that 
•  n-1*  n*    ' 

Hyn"zn"  -  ,lyn-l~zn-l"'   For' multistep 
methods,  in  which   r   vectors  in   X   are 
used  to  represent  the  solution 
information  at  the  end  of  a  step,  it  is 
necessary  to  construct  a  norm  on   X 
using  the  tensor  product  of  the  inner- 
product  on   X   that  comes  with  the 
problem  and  a  new  inner-product  on   1R 
that  depends  on  the  method.   Bounds  on 
the  behaviour  from  step  to  step  can  then 
be  expressed  in  terms  of  the  norm 
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formed  froa  the  tensor  product. 

tf   G   La  the  matrix  of  the  Lnner- 
producl  i  n   IK  ,   which  i  a  character- 
Lei  Lc  of  the  method |  the  corresponding 
stability  propci  t  \  i -.  known  as 

i itabilitj  .Hid  \>a^  firsl   Used  by 

G.  Dahlquisl   S  .   The  corresponding 
propertj  ol  Runge-Kutta  methods  was 
i  i  i  —  t  proposed  i>\  the  present  author  [4l 
.uui ,  i  ii  collaboration  with  K<-\  i  n  Burrage, 
waa  further  developed  [1]  and  combined 
with  tbe  ideas  of  G-stabilit>  to  g i  \  e 
the  algebraic  stability  property  for 
genera]  linear  methods  [2].   This  paper 
i-^  pai  tly  .i  re\  Lew  of  the  joint  work  by 
Di  .  Bui  i  age  and  the  author,  and  partly  a 
study j  t'rom  the  viewpoint  of  algebraic 
Stability ,  of  three  special  methods. 

Be  methods,  the  backward  differen- 
t  Lai  ion  methods  of  orders  1,  2  and  3, 
are  important  components  of  many 
algorithms  in  present  use  for  solving 
stiff  problems. 

Part  itioned  general  linear  methods. 
Let   \.  ,   be  an   sxr   matrix,   A    an 


B1  .   an   sxs   matrix   and 


rvr   matriXj 
B     an   rvs   matrix.      A  partitioned 
general  linear  method  is  a  general  linear 
method  (or  (A,B)  method)  [3],  such  that 
V   and   B   are  each   (s+r)x(s+r) 
matrices  partitioned  as  follows 


0   A 


1  2 


0   A 


22 


B  = 


Bll   ° 


B21   ° 


where,  of  course,  the  various   0 
matrices  are  not  necessarily  the  same 
si/es.   We  will,  for  convenience,  denote 
such  a  method  by  writing  the  non- 
trivial  parts  of  its  coefficient 
matrices  together  in  the  format 


12 


22 


15 


11 


21 


For     such    a    met  hod,     when     USed     to 

solve  the  equation   \'(\)    i(\(\)) 
with  step  size   Ii,   the  a+r   vectors 
which  represent  al]  the  quantities 
computed  in  step   n,   saj 

(n)  „(n)      Y(n)   (n)   (n)       (n ) 
]   '2   "••'\s   'yi   'y2   '•••'', 

are  related  to  t he  previous  step  by 
(2) 


Y(n)  =   Ja(H)  (n-1)  + 


j=l  ^ 


h  J  b(ll)f(Y(n)),   i=l,2,...,s, 
j=l  iJ     J 


(n)     '   (22)  (n-1) 
y     =   V  a     y     '    + 

i      j=l  ij    j 


h  rb(2l)f(Y(n)),   i=l,2,...,r, 
j=l  ij      J 


.        (12)    (22)   .(11)   .(21) 
where   av    ,  a     .  b     ,  bv      are 


(3) 


1J         !J 

typical  elements  in 


ij 


A12>  A22>  Bll 


21 


respectively. 


Y(n\Y(n) 


.(n) 


12         s 
can  be  interpreted  as   s   stages,   as  in 

a  Runge-Kutta  method.   We  see  from  (2) 

and  (3)  that  no  use  in  later  steps  is 

made  of  these  quantities  once  the  step  is 

completed.   On  the  other  hand, 

(n)   (n)       (n)  .  +u 

y    ,y    ,...,y      represent  the   r 

12         r 
quantities  needed  at  the  end  of  a  step  to 

allow  the  computation  of  the  next  step. 

Although  we  have  used  the  symbol 

y      for  one  of  these  pieces  of 

i 
information,  it  is  not  necessarily  an 

approximation  to  the  solution  value  near 

the  n    integration  point   x   .   Rather, 

it  should  be  thought  of  as  an 

approximation  to   u.  v(x  )+hv.  y  (x  ) 

where   u.,v.   are  numbers  characteristic 

of  the  method. 

The  special  case   r=l ,  with   A 

equal  to  1,  gives   s   stage  Runge-Kutta 

methods,  whereas 
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12 


22 


11 


21 


*1 

2 

•    °k 

Pi 

H 

Pk 

Po 

°1 

o2    .. 

•    °k 

h 

p2 

Pk 

Po 

1 

0       .  . 

.       0 

0 

0 

... 

0 

0 

• 

1 

. 

• 

. 

. 

. 

0 

0       .  . 

1      0 

0 

0 

0 

0 

0 

0       .  . 

.       0 

0 

0 

0 

1 

0 

0       .  . 

.       0 

1 

0 

.  .  . 

0 

0 

• 

• 

• 

• 

1 

• 

• 

•- 

• 

• 

• 

• 

• 

0 

0       .  . 

0 

0 

0 

.  .1 

0 

0 

gives    the   k-step    method 


yn  =    °lyn-l    +    a2yn-2+'"+akyn-k 


+  h(30f(yn)  +  31f(yn_1)+...+3kf(yn_k))  , 


l •    l       v(n)  (n) 

xn     which      Yv        =  y  =  y„ , 

1  1  n 


(n) 


.(n) 


'n-l: 


=  y. 


n-k+l , 


2 
yk+l=hf(yn>>    y(Zl=    M^n-1>- 


(n)        LC/ 

y     =  hf(y 
2k 


n-k+l 


k+2 
). 


A  non-linear  test  problem.   Let 
(X, <•,•>)   denote  a  real  pseudo  inner- 
product  space.   We  recall  that  an  inner- 
product   <',#>   on   X   is  a  bilinear 
functional  such  that   <u,v>  =  <v,u>   for 
u,v  6  X   and  such  that  for  non-zero 
u  €  X,  <u,u>  >  0.   In  a  pseudo  inner- 
product  this  inequality  is  weakened  to 
-ii,u  -  >  0  .   Let   II  •  H   be  the 
corresponding  pseudo  norm  defined  by 
Hull  =  <u,\i>1^1      for  all   u  e  X  . 

Let   f  :  X  -.  X   be  continuous  and 
such  that,  lor  all   u  6  X 

2 


where   5   is  a  real  number.   A 
consequence  of  (4)  is  that  any  solution 
y   to  the  differential  equation 
y  (x)  =  f(y(x)),   is  bounded  in  pseudo 
norm  by  a  multiple  of   exp(i'x).   To 
verify  this,  we  need  only  differentiate 
Hexp(-?x)y(x)||2  =  exp(-2?x)<y(x),y(x)> 
to  obtain  a  zero  upper  bound  on  the 
derivative. 

This  rather  artificial  test 
problem  can  be  used  to  study  the 
behaviour  of  the  difference  of  two 
solutions,  as  in  the  G-stability  theory, 
even  when  the  differential  equation  under 
study  is  non-autonomous. 

If   G   is  a  non-zero  non-negative 
symmetric   rxr   matrix,  then  we  can 
define  a  pseudo  inner-product  on   X 


using  the  elements   g.  .  (i,j=l,2, 


,r) 


of   G,   and  an  existing  pseudo  inner- 
product  on   X.   Denoting  the  pseudo 


inner-product  on   X 
define  i 

v  =  (v1,v2,... 

<u,v>G 


by  <-,->G, 

define  its  value  for   u  =  (u, ,u„, 

>vr)    by 


>ur)» 


i,J=l 


g .  .  <u  .  ,  v  .  > 


(4) 


<u,l  (u)   '-    '||u| 


We  will  use  the  pseudo  norm   H'llf  > 

defined  from  <•,•>„,      as  the  measure  of 

r    •       • 
the  size  of  a  vector  in   X    with  a  view 

to  obtaining  bounds  on  the  behaviour  of 

/(n)(n)       (nK  ,    .    ,-,-> 

(y    ,y    ,...,y    ),   computed  using  (1), 
12         r 

in  terms  of  the  behaviour  at  the 
previous  step.   We  will  attempt  to  find 
bounds  that  correspond  in  some  way  to 
bounds  on  the  behaviour  of  the  solution 
to  the  underlying  problem. 

Algebraic  stability  and  its  general- 
izations.  A  partitioned  general  linear 
method  (l)  is  said  to  be   (k,/.) 
algebraically  stable  for   k   a  positive 
number  and  I      an  arbitrary  number,  if 
there  is  a  non-zero  non-negative  symmetri; 
matrix   G,   and  a  non-negative  diagonal 
matrix   D,   such  that  the  (symmetric) 
matrix 
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M 


I  T 

42D~A22GB2] 


"2*A12DA12 


•2''A12DB1] 


DAI2~B21GA22      DB11+B11D~B21GB2] 

T  T 

-2tBllDA12  "MB11DB13 


i  -^    non-negat  Lve, 

It     is    a    principal    result    in    Burrage 
and    Butcher    [2],    that    if    a   method 
satisfies    this    condition    and    if    a    problem 
satisfies    (4)    with       >    =    h  » ,       then 


(>1        ,y2 


(n)x 

,y         )      can   be 


(pseudo-)    estimated    by 


'G   = 

ftfe  are  interested  in  obtaining  for  a 
method  a  (k,-?,)  algebraic  stability  result 
in  which   k   and  i,      are  functionally 
related  by  to(t)    =  k  '   .   To  the  extent 
that   tp   is  a  good  approximation  to  the 
exponential  function,  we  obtain 
behaviour  for  the  computed  results  that 
is  a  good  model  for  the  behaviour  of  the 
exact  result. 

In  particular,  if  a  method  is  (1,0) 
algebraically  stable  it  will  be  called, 
simply,  algebraically  stable.   In 
addition  to  this  non-linear  generaliz- 
ation of  A-stability,  we  are  also 
interested  in  knowing  whether,  given 
k   as  an  arbitrarily  small  positive 
number,   (k,/, )   algebraic  stability  can 
be  obtained  for  sufficiently  small  I 
(generalizing  the  "behaviour  at 
infinity"  aspect  of  L-stability ) ,  and 
whether  we  can  obtain   (l,i.)   stability 
for  sufficiently  small   I   (generalizing 
part  of  the  definition  of  stiff 
stability). 


Finally,  Ln  t  h  i  s  section,  we  note 

that,  Lf   Ik  ,   Ls  non-singular  ( as  Ls 

typicallj  the  case  Ln  methods  suitable 

for  s1  iii  problems),  .i  congruence 

trans formal  Lon  can  be  app]  i cd  to   M 

given  by  (S),  to  yield  a  symmetric  matrix 

<>i   i  simpler  form  for  many  methods  .   Ii 
-1 

V,,  =  V,.,  -  ^oiBll  12   a,icl  ' ' '*'  transforming 
matrix  is 


I       0 
"Bl'A12    T 


then       M   =    P    MP       is    given    by 


;6) 


M   = 


kG-A22GA22 


T       „ 


-A12D    A22GB21 


■DA12~B21GA22  DB11+B11D-B21GB21 


-2£BnDBii 


Where   M   takes  on  a  simpler  form  than   M, 
we  would  use  its  non-negativity  as  a 
criterion  for   (k,/, )   algebraic  stability. 

Application  to  backward  differentiation 
methods .   The  backward  differentiation 
methods  of  orders  1,  2  and  3  respectively, 
given  by 


(7)  y   =  y   ,  +  hf(y  ), 
Jn    Jn-1       J n    ' 


(8)  yn  =  4yn-l  "  Tn-2  +lhf(yn}' 

(9)  yn  =  TTY^!-  ifyn-2+  ITyn-3+  TThf<yn)« 


are  the  first  three  members  of  a  sequence 
of  important  methods  for  stiff  problems. 
The  order   1   and   2   methods  are  known 
to  be  A-stable  whereas  the  3rd  order 
method  given  by  (9)  has  a  stability  region 
that  includes  all  points  in  the  complex 
plane  with  real  parts  less  than  -112  . 
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As  partitioned  general  linear  methods  they 
are  given  respectively  by 


4 

-1 

2 

3 

3 

3 

4 

-1 

2 

3 

3 

3 

1 

0 

0 

and 


18 
11 

-9 
11 

2 
11 

6 
11 

lH 
n 

-9 
11 

2 
11 

6 
n 

l 

0 

0 

0 

0 

1 

0 

0 

their  M  matrices  (as  given  by  (16)), 
with  D  set  arbitrarily  equal  to  1  in 
each  case,  are 


Theorem  2.   The  second  order  backward 
differentiation  method  is   (k,/, ) 
algebraically  stable  for   (k,.f, )   related 
so  that 

(10)  I    =  \   -   -£    ,  0<k<l  , 

(11)  K    =  |    -    2k"1/2    +  ik"1    ,    k>l      , 
or,    what    is    equivalent, 

k  =    1/(1-2/,)     ,       t<0    , 


kg 


11 


-1 


kgll-g22 


-1 


2-gi:L-24 


-4-4 
'12         3    3^12 


kg12 
•3    3e12 


kg 

1 

3 


22 


1 

3 

!— i 

3   9feH    9 


7tl 


and 


kgll"g22        kg12"g23      kg13 
kg12-g23        kg22-g33      kg23 


kg 


13 


kg 


23 


kg 


33 


-18  _6      _9  _6 
ll"llg12   ll"llg13 


-2 
11 


JJ3  _6 
ll"llg12 


ll"llg13 

22 
11 

11  121811 

— 72i 
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Not  surprisingly,  determination  of 
which   (k,/,)   pairs  give   (k,/,)   algebraic 
stability,  is  trivial  for  the  method 
niven  by  (7)  and  progressively  more 
complicated  for  (8)  and  (9)- 

Ih f;orem  1  .   The  first  order  backward 
differentiation  method  is   (  k  ,  J, ) 
il '^nbraically  stable  whenever 


k1/2  =  i/(2-(i+2t)1/2)  ,  t>q  . 


Proof.   We  will  show  that,  with  t 
related  to   k   as  in  (10)  and  (11), 

gll,g12,g22   may  be  cnosen  so  tnat   M 
is  of  rank  1  with  non-negative  diagonal 


elements , 


Let   a  =  kg 


12> 


3  =  kg22   so 


that   M   being  of  rank  1  would  require 
that  g1T   =  a2/pk  +  p/k2,  g12  =  a/k  , 
grt^  =  P/k   (implying  that   G   is  positive 
if   3^0)  and  furthermore  that 

(12)  a  +  3 (4+2 a/k)  =  0  , 

(13)  (12-8-03  =  4a2/k  +  432  'k  +  1. 

Since  the  values  of  I      given  in  (10) 
and  (11 )  satisfy  the  inequality 
12  -  8.1  >  0,   any  real  solution  to  (12) 
and  (13)  will  necessarily  have   3>0 
(because  the  right  hand  side  of  (13)  is 
positive).   Hence,  it  remains  to  show 
that,  with  the  given  value  of  t,       (12) 
and  (13)  do  in  fact  have  a  real  solution 
pair.   If   3   is  eliminated,  the 
resulting  equation  for   a  can  be  written 
in  the  form 

(14)  2(a+k)4  +  (a+k)2(k-k2-2/,k2) 

+  (k3-k4+2*,k4)  =  0 


Pi  00 f  . 


<  hoosf 


,-1/2 


;n 


If   k  <  1 ,   the  value  of  t      given  by 

(10)  is  satisfied  by   a  +  k  =  0.   If 

1    I,   the  value  of  t      given  by  (11 )  is 
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~»«n  h   aa    bo   sake    (\A)   •>   perfect    square 
.iiui    the   negativity    of   the   coefficient 

Di       ia+k)"      guarantees    the   existence   oi     a 

> 
posit  i  \  <•   root    for      (  a+k  )  "    . 


theorem 


Hie    third    order    back- 


ward  different  i.it  Lon   method   is 
ik,')      algebraically   stable    for 
(k,t  )      pel  .u  ed   so    I  tut 


(15)  4 


17 
U 

li 


4k 


-1^  -3)3/2 

24Vk      JJ        : 


°<"S# 


do)    t-^+*| 


^    2(3K+1),    k>$ 


Proof i        ^s    in    the    proof    of    theorem    2,    we 
\>  i  1  1     ->lu>\>    that       G      may   be    chosen    so    that 
M      is    of    rank    1    with    non-negative 
diagonals.       Let       a  =  kg-i-i»    P  =   kg,!) 
■y   =    ku ■  , ,      so    that 


•33 


G   = 


-2-  +  £—  +  -I 
k-        k2v       k^ 


°£  +  _J 


i: 


k-y 


k 


a 


i L44ft-108k )  , J' I l 44k    ■  ■ 63k3)-y 


■  < l 6k  * 6+1 2k*)        =   0, 


( 21  )       (129652-1944k6+720k2+144k  )  , Z 

-16mk4Y+(l44k352+216k4fi+8lk!'+16k4)=0 


Eliminating      y      from    (20)    and    (21 )     Leads 
to    an    equation    that    can    bo   writ  ten    as 

(22)  (a64+bfi2+c)2   -    df,2   =   0    . 

where      a  =    20736,    b   =   864k(8-3k), 

c   «   192k4m   -    63k3(l6+8lk), 

d   =    65536k4(km+27)2   -    2359296k3(l+9k) 2    . 

What   we   will    show    is    that    the   value   of      n 

resulting   from      -l      chosen   by    (IS)    and 

(16)    leads    to    a    repeated    non-negative 

2 
root    for       6         in    this    equation. 

In   the   case      k   >    64/49,       the    choice 

of      I      implies    that      d=0      and   that 

c   =   9k5/2(4-9kl/2)2(8-7kl/2)    <  0    . 

It  turns  out  V   at   k  <  64/49   would 

2 
not  allow  a  positive  root  for   6    to 

exist  if   d=0   so  we  consider  the 

condition  that  (22)  will  have  repeated 

roots  if   d  7^  0.   This  condition  is 


and,  writing   m  =  3 3— 1§ -t- ,   the  conditions 
on  the  last  row  and  column  of   M   for  it 
to  be  rank  1  are 


(17)   9k2Y  +  3kap  +  3Py  -  ok2  =  0  , 


H-)       9ky   +  2k p   -  6av 


=  0 


(10)   k3  +  9k2a2  +  9kp2  +  9y2   -  mk3y  =  0  , 


and  we  note  that,  since  the  values  of  I 
siven  by  (IS)  and  (16)   imply  that   m>0, 
the  positivity  of  y,       if  real  solution 
trios   (a,P,y)   to  (17),  (18 )   and  (19) 

are  shown  to  exist,  will  be  assured.   It 

3 

is  convenient  to  substitute   a  =  -rk  +  <j 

4 

and  to  eliminate   P   from  (17),  (18)  and 
(19)  to  obtain  the  following  equations 


( 23 )  27ad2+d( 4b3-144abc )-c ( 4b  2-l 6ac ) 2=0 

and  we  will  require  of  the  solution  we 
obtain  to  this  that,  in  addition,   d  %  0. 
With  our  values  of   a,b,c  and  d,   (23) 
can,  after  considerable  manipulation,  be 
tvritten  in  the  form 


(24)  (2m- 24  3+  -t^)2 

x((4m-8l-  M)2  _  32(4  _3)3)  _  0 


with  one  of  the  roots  given  by 


(25)   m  =  81/4  +  27/2k  +  |(4  -3)3/2 


which  leads  to  a  value  of  I      given  by 
(IS). 
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It  remains  to  prove  that,  with   m 
given  by  (25),   d   is  positive  or,  what  is 
equivalent,  that 


-21   +  6k"3  2(l+9k) 


for   0  <  k  <  64/49  . 

Multiplying  by  the  positive  quantity 
4k"  V3,   we  find  as  an  equivalent  require- 
ment that 
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(26)  27k3/2+5' 


/2-8(l+9k)+(4-3k)3/2>0 


Since      27k3 /2   +    54k1//2    -    8(l+9k) 

=    (3kl/2-4)(3kl/2-(2V"2))(3k1/2-(2-V"2)), 

(26)  is  clearly  satisfied  for 

k  £  [(2i/2")/3,(2+/2)/3]   and,  since  the 

left  hand  side  of  (26)  vanishes  when 

k=0   or   k  =  64/49,   it  will  be  sufficient 

to  prove  that  the  derivative  of  this 

expression  is  positive  when 

1  In 

k     6  (0,(2-V"2)/3)   and  negative  when 
kl/2  £  ((2V2)/3,  8/7). 

The  value  of  this  derivative  is 


(27)  ^L  k2  +  27k  2  -  72  -  -|(4-3k) 


and,  when   k1'2  €  (0, ( 2-Jl  )  /3  )  ,   this  is 
bounded  below  by 


81  2-£L   +  27_3_ 
2^2 


1 
72  -  |(4-0)2  >  0 


because  -^k1'2  +  27k"1/2   is  easily 
verified  to  be  an  increasing,  and 

-  -^•(4-3'<)  '    a  decreasing,  function  in 
the  interval  in  question.   For 
k1  '2   6  ((2V2)/3,8/7),  ^kl/2  +  27k"l/2 
is  an  increasing  function  and  we  bound 
(27)  by  substituting   k  =  64/49   to 
obtain   -27/8  <  0  . 

A  final  remark:   the  backward  differ- 
entiation method  with  order  3  is   (1,-  y^) 
algebraically  stable  and  it  is  noted  with 
Lnterest  that  the  abscissa   -  yr   exactly 
eea  with  what  is  obtained  in  a  linear 
Pf— stability)  anal ys  i  ,  . 
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A-STABILITY  OF  ONE-  AND 
MULTISTEP  METHODS 

nerhard  WANNER 


Abstract.  This  work  with  S.P.N«$rsett  and 
E.  Hairer  deals  with  a  beautiful  and  reo- 
metric  discussion  of  A-accertability  pro- 
perties of  stability  functions  arising  f~om 
one-  and  multi-sten  methods. 
We  "ive  many  necessary  or  sufficient  con- 
ditions for  A-stability  and  nrove  the  con- 
jecture of  Ehle  (1969)  as  well  as  the  con- 
jecture of  Daniel  and  Moore  (1970). 


Properties  of  Order  Stars. 


Consider  a  rational  stability  function  of  a 
one-step  method 

M.   „,  ,.  P(z)    de^(P)=a  (nb.  of  zeros) 
nvz;"  OTzT   dee(Q)=a.  (nb.  of  poles) 

with  the  order  condition 

,~s      z  Bf    N       n  +  1       n+2^-,  p  +  3\ 

(2)  e  -R(z)=e  ..z-   +  e  ._z    +0(zf  J)  . 

P+i     r+^ 

R  is  said  to  be  A-acceptable  (or  the  method 
is  A-stable)  if|R(z)(<l  on""the  entire  left 
half-plane  C~. 
A  typical  form  of  the  stability  domain 

(3)  D  ={z;  |R(z)|<l) 

is  sketched  in  Fipj.l  (for  the  8-1  Pade 
approximation) . 


■ ::  x : :  t  • 

1 1 1 1  *  t  *. 

1 :  ■ :  •  1 1 1 1 1 

1 : :  I  -. ;  1 

■ntuuit 

1 1  1  t  ■  1  1 

n  ■ 

tnttt'-i 

U„!»! 

'.%%%%%• 

mnutii 

uittit 

•    .•  ■  |*  i'i 

i . :  1  ■  1  - 

1 ;  •  1 1 1 1 1  u 

;iinv. 

1  ■■  1 : 1 1 •  1 1 

' :  1 1 ! :  • 

IIU1II111 

vim* 

■  1  ■.  *  1 1  •  1 1  • 

until 

1  •  *  1  •  ■  1 :  \  ■ 

;  1 1 1 : 1 i 

\  \  •  l  •  1 : 1 : 1 

:uiii: 

*. ;  ■.  \  1 1 J 1  11 

Ittllllttl 

iv.ni; 

UltUUII 

:  1 1 1 : 1 1 
.  1  •  l  s  \  1 

■ 

111  1 1 1 1 1 1 1 

;  1  •- :  •  ■  *.  11 
1  ;■•.••  1 1 1 : 

■  *  1 : '  1  • 
lllt'll 
It It'll 

1 1  •*-.:■  1 1 : 
UU*11tt1 
1 1 1 1 : 1  •  1 1 1 

tilt  Ml 
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■ 

1  ■  -.  *  1 : 

tIMUUII 
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The  form  of  these  stability  domains  is 
clumsy  and  not  very  instructive.  So  the 
first  treatments  on  A-acceptability  have 
been  loaded  with  difficult  computations. 
In  contrary,  the  set 

z 


('))   A  ={z;   R(z) 


} 


for  the  same  rational  approximation  is  sket- 
ched in  Fip;.2  and  pives  much  more  insicht 
(like  a  hand  under  an  X-ray  skreen) . 
At  every  point  where  R(z)  has  a  contact  of 
order  p  with  exp(z),  the  set  A  consists 
of  p+1  regularly  distributed  finders,  so  we 
call  it  the  order  star  of  R. 


Fifr. 


Example  of  an  order  star, 


F1r;.1.  Example  of  stability  domain. 


The  following  properties  are  conseouences 
of  simple  complex  analysis: 

[Lemma  1 .  On  the  imaginary  axis,  A  and  D  are 
complementary.  So  we  can  have  A-acceptability 
only  if  A  does  not  meet  ilR . 

Reason:  Because  of  |e  ^|=1  for  y  real. 

[Lemma  2 .  ^or  z+0,  A  behaves  like  a  star  with 
p+1  "finders"  of  equal  width  w/(p+l),  sepa- 
rated by  p+1  similar  "white  finrers"  of  the 
complement  of  A. 

Reason:  ror  z+0,  z€A  is  equivalent  to 

|R(z)e-z|  =  h  -  e-z(en  +  1zn  +  l+en  +  2z^  +  2+..)|>l 
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or  Re(ep+lZP+1+(ep+2-eD+1)z^+2+..)<0. 

Thus  the  sign  of  the  error  constant  deter- 
mines the  color  of  the  finders. 

fLemma  3.  For  z*«m,  A  tends  to  the  negative 
half-plane  with  exactly  two  boundaries 

leroing  to  infinity. 

*~  ^   z 

Reason:  elementary  properties  of  e  . 

Lemma  4.  Each  bounded  finder  of  A  contains 


at  least  one  pole  of  R;  each  bounded  white 
finger  contains  at  least  one  zero. 
If  several  bounded  fineers  collapse,  then 
they  contain  together  at  least  as  many 
poles  or  zeros  as  there  are  fingers. 

This  lemma  is  less  trivial,  see  flf . 

While  Lemmas  1  and  3  are  linked  to  special 
properties  of  the  exponential  function, 
Lemmas  2  and  4  remain  true  for  any  order 
stars  of  two  functions  R(z)  and  S(z). 

2.  A-acceptability  of  Pade  approximations. 

For  Pade  approximations  we  have  p  =  a+in  and 
Lemmas  2  to  h   do  not  permit  any  other  con- 
figuration as  u  black  fingers  (bounded)  to 
the  right  and  o  white  bounded  fingers  to  the 
left.  Because  of  Lemma  1  and  4,  we  have 
A-acceptability  iff  the  imaginary  axis 
passes  behind  the  last  bounded  black  finger 
(as  indicated  by  dots  in  Fig. 2).  This  is 
equivalent  to 

o<aj<a  +  2  , 
This  includes  the  "conjecture  of  Ehle". 


3.  More  results  on  rational  approximations. 
The  following  generalizes  the  Ehle-conjectur: 
Theorem  5.  If  R  is  A-acceptable ,  p<2a+2. 


Q 


[Theorem   9.    Suppose   that   n>2<o-l    and    |R(»)|<1. 
[Then  R    is   A-acceptable. 

Proof.    The   hypotheses   need   all   poles    in   C    . 

(see    /l/). 

Theorem  10.  Sunpose  p>2w-2,  !R(-)|<1,  and 

the  coefficients  of  Q(z)  have  alternatine 

signs.  Then  R  is  A-acceptable. 

A  similar  version  of  this  has  been  proved 
differently  by  Crouzeix-Ruamps  (1977). 
Theorems  of  Ehle's  Thesis  are  special  cases. 

Theorem  11. Suppose  p2.2w-3,  |R(iy)|<l  for  all 


Proof.  At  least  [(p  +  1)/?]  fingers  of  A 
start  in  C".  These  have  to  be  infinite  and 
enclose  at  least  [(p  +  l)/2]-l  bounded  white 
fingers.  Since  in  total  we  have  a  zeros,  it 
follows  that  [(p  +  l)/2]-l<a.     D 
The  next  theorem  will  below  become  the 
conjecture  of  Daniel  and  Moore: 

[Theorem  6.  If  R  is  A-acceptable,  p<2w. 

4>roof.  At  least  [(p  +  1 ) /2]  fingers  of  A  start 
in  C  .  They  cannot  cross  iff!  and  must  there- 
fore be  bounded.  Since  in  total  we  have  u> 
poles,  it  follows  that  [(p+1) /2]  <u> .  a 

[Theorem  7.  If  R  is  A-acceptable  and  p=2w, 
then  for  the  error  constant  sign(ep+1) = ( -1 ) 
Proof.  Look  at  the  color  of  the  fingers. 

[theorem   8.    If   R    is   A-acceptable   and   p  =  2a), 
"•"   iviNs.)i&iin  ■ 

Proof.  Consider  the  order  star  R  where  R(z) 

is  "compared"  to  the  diagonal  approximation 

R   (z). 

The  following  are  inverse  results  assuring 

A-acceptability: 


y  real,  and  the  coefficients  of  0(z)  have 
alternating  signs.  Then  R  is  A-accentable . 

"Proof.  Similar  as  for  9  or  10.  see  121 . 
Following  are  the  exact  characterizations 
of  rational  approximations  of  order  2u-3 

and  20)-^: 

All  rational  approximations  of  order  2u>-k 

can  be  written  as 


,(i) 


(5)   R(z)  =  Ii:oM(i)(Dzu-1/Ii:oN^i;(0)z- 


where 


N(t)=Pu(t)+A1Pu_1(t)+..+AkPw_k(t), 

P  (t)  =  (^r)<Jj(tu(t-l)aJ)  (see  /*/  ,Prop  .  3)  . 


TO 

[Theorem   12.    R(z)    of    (5)    of   order   2w 
A-acceptable   iff 
i)    1+A2>0,    ii)    Aa+A3<0,    iii)    A3>0. 


3    is 


Theorem   13.    R(z)    of    (5)    of   order   2ui-4 
A.+A,<0 


is 


A-accentable  iff 
i)  1+A  +A,>0,  ii) 


2  "4-  '  x";  "1   3" 
iii)  (2u-5)A1Ai(  +  (2ui-l)A3>0,  iv)  A^O. 

The  proofs  involve  Theorems  9  and  11  above 

and  some  continuity  arguments  of  the  poles 

of  R(z)  as  functions  of  A1  ,A2 ,A^ ,A^ . 

See  /6/  for  details. 

The  following  theorem  gives  an  order  bound 

for  approximations  with  multiple  poles  or 

zeros: 

Theorem  l'l.  Suppose  that  R  possesses  only  a( 


different  zeros. and  uo  different  poles.  Then 


p<a  +w_ . 
o   o 

Because  of  symmetry,  real  but  different  poles 
play  sometimes  the  same  role  as  one  multiple 
pole.  So  this  theorem  is  related  to  the 
highest  attainable  order  with  real  poles. 
See  the  talk  of  S .P .Ndrsett  for  details. 

>\ .  Extension  to  multi-step  methods. 

Eor  a  k-step  method  the  rational  function 
(1)  is  replaced  by  a  characteristic  polynom- 

(1')  0(z,R)=Qo(z)Rk+Q1(z)Rk"1  +  ..+Qk(z)=0 

0  (0)*0,  deg(Qk)=a,  deg(Q0)=w. 

R(z)  becomes  an  algebraic  function  to  be  con- 
sidered on  its  Riemann  surface  M.  For  linear 
multistep  methods  we  have  a<l ,  w<l  .  For 
other  classes  of  methods  o  and  u  are  related 
to  the  number  of  involved  derivatives, 
stages  etc. 
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-  the  fact  that  R..=l  is  a  sinrle  zero  of  References . 

Q(0,R)  and  can  thus  ne  analytically  extended  ,«,„.,      ,-  ,,  .     „  „  .,,       „    ^ 

in  a  neighbourhood  of  zero  to  a  holomorphic  /U  V 'anner  >F  •  "*lr*r**  •£.«*«•«  Order 
function  R^z),  the  order  condition  can  now  mqS!  w-a-^oo  V  theorems'  RTT  1* 
he  *rit1  •   -  (1978),  Wtf? 

z   R  #_x  _      p+1  +  nf7n+21  /2/  H. Wanner, E.Hairer.R. P. Nrfrsett,  When 
n\KZ>     ■    ep  +  lz            ;'  I-stabillty  implies  A-stability, 

The  above  Lemma  2  now  carries  over  to  the  PI"'  18  (1978),  p  5a3 

corresponding  sheet  of  the  Riemann  surface,  ,,.  _  „  .„    ,,    *,^t         i  •■      .  -,     tl_  . 

when  we  now  consider  the  order  star  as  a  /V   -"ai  er   Hncondl   onallv  stable  methods 

subset  of  M.  Also  for  Lemmas  1,3,  and  U  ^"^^'r  ^/Th"  \l  Le°"at  l0nS  ' 

.  ,        ,       j     mw  to  anpear  in  Numer.  Math.  ,1979 

exist  natural  extensions.  Thus:  ''  7 ' 7 

^v,^„o~c  £  7  P   or,^  11  r^^moir,  ^r,..^  i  „  fv,.  / '' /  S . P . Fdr se t t , 0 . Wanner ,  The  real-nole 
Theorems  o,7,c,  and  11  remain  true  in  the  _  ...  .  -  '    . .    '       . 

multl-ster  case sandwich  for  rational  approximations  and 


oscillation  enuations,  appears  in  PTT 


Theorems  6  and  8  have  been  known  as  the  1979. 

"conjecture  of  Daniel  and  Moore",  the  special  ....  „  _  „.    ..  _  ,.       n 

case  u  =  l  is  Dahlouisfs  classical  Theorem  /5/  S.P.^rsettO. Wanner   Perturbed  collo- 

that  p  =  2  is  the  highest  attainable  order  of        cation  and  Runre-Kutta  methods,  submitted 

an  A-stable  linear  multistep  method.  zo     lumer-   atn- 

Theorem  9  needs  the  following  modification:  /6/  Y .Burrapre ,C .Wanner ,  Characterization  of 


Theorem  9'.  Suppose  that  p>2u>-l  and  _       _   ,    .  _ 

R(ly)!<l  for  all  y€R.  Then  R  is  A-accenta-        °f  °rder  2s"3  and  2*~k >    to  he  written. 


jble 

La  extends  the  result  of  Jeltsch  (PIT 
1978)  to  u>l. 

Theorems  5  and  1*J  do  not  extend  as  they  are 
to  the  multistep  case,  since  the  Proofs  use 
the  fact  that  C  Is  simply  connected,  which 
is  no  longer  true  for  arbitrary  Riemann 
surfaces . 

5.  Extension  to  second  order  enuations. 

If  a  stability  test  for  methods  for 

y"=f(t,y) 

is  made  at  y"+By=0  with  P real,  the 
characteristic  polynomial  becomes 

(1")   Q(z,R)=Qo(z2)Rk+..+Qk(z2)=0, 

where  now  R=l  becomes  a  double  zero  of 
Q(0,P)=0.  But  since  there  appears  z2  in 
(1")  only,  there  is  no  monodromy  around 
this  point  on  the  corresponding  Riemann 
sheets  and  these  two  zeros  continue  as  holo- 
morohlc  functions  in  the  neighbourhood  of 
the  origin.  One  of  these  two  solutions  then 
satisfies  the  order  condition  (2') 
and  there  is  a^ain  a  beautiful  star  on 
the  corresponding  sheet  of  the  Riemann  sur- 
face. So  Theorems  6, 7, 8, and  11  remain  true 
also  in  this  case,  The  special  case  u=l  was 
proved  by  Dahlquist  (BIT  1978).  Theorem  6 
has  the  following  corollary: 

Corollary  6 ' .  If  the  solution  of  0(z,R)=0 

Tsee  Lambert -V/at  son  ,  J.Inst. 
Math. Applies.  1976:  this  is  eouivalent  with 
the  fact  that  all  roots  of  0(iy,R)=0,  y  real, 
satisfy  |R|=1),  then  p<2u. 

See  Hairer  /3/  for  details.  Hairer  has  con- 
structed unconditionally  stable  methods  of 
higher  order  for  this  type  of  enuations. 


A-stable  and  P-stable  Runee-Kutta  methods 
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ORDER    AND    STABILITY    OF    COLLOCATION    METHODS    WITH    REAL    EIGENVALUES. 


S.P.     N0RSETT 

INSTITUTE    OF    NUMERICAL    MATHEMATICS 

UNIVERSITY    OF    TRONDHEIM,     TRONDHEIM-NTH 


The  collocation  method  for  obtaining 
an  approximate  solution  to  the  initial 
value  problem 

(1)    y'=f(t,y),  t€(a,b),  y(a)=yQ  , 

amounts  to  finding  a  polynomial  u  of 
degree  at  most  m  such  that 

i)     u(t0)  =  y0 

ii)     u'(t  +  c.  h)  =  f  (t  +c.  h,u(t  +c.  h))  , 

0    l  0    i    '      0    i 

i  =  1  ,' • • ,m  , 
iii)    yl     =    u(tj  =  tQ+h) 

where  c.  ,  i  =  1,««*,m  are  m  distinct 
real  numbers.   By  using  a  result  of 
Alekseev  and  Grobner  on  the  nonlinear 
Variation-of-constants  formula  for  (1) 
a  very  short  and  nice  proof  can  be  given 
for  the  already  known  result   that  the 
collocation  method  possesses  the  same 
order  as  the  corresponding  quadrature 
formulae . 

When  the  collocation  scheme  is 
applied  to  the  test  equation  y'  =  Ay, 
z  =  Ah,  we  get 

•ji     =  R(z)yQ  =  (P(z)/Q(z))y0 


where 


(  m) 


P(z)  =  N(1)z  +---+!r  '  (1  )  , 


/.)  =  M(0)zm+---+!I(m)  (0) 

and  N(t)  =  (t-c.  )  • • • (t-c  )/m!.   R(z)  is 
an  approximation  to  exp(z)  of  order  s>m 
method  has  order   .   (N  is  called 


the  N-polynomial  to  P.  and  is  related  to 
the  C-polynomial  p  of  N0rsett  by  N(t)  = 
(-1  )mp(1-t).  ) 

It  is  well  known  that  the  collocation 
method  is  equivalent  to  a  special  subclass 
of  Runge-Kutta  methods  with  RK-matrix 


{au  =  /£.  (tMt} 


where 


i  ,  i  =  1 


I.    (l) 
J 


n  (t-c,  )/  ji  (c.  -c,  ) 

k  i    k 

k*j  k*j 


By  writing  Q(z)  =  ( 1 -y, z ) • • • ( 1 -y   z)  we 
have  that  y    ,  ••*,Y   are  the  eigenvalues  of 

1  w 

A  and  that  if  y.     is  a  simple  eigenvalue  of 
A,  a  corresponding  eigenvector  x.  of  A  is 


(x 


H' 


'   nj 


I  J 


=  N'  (c.  )y.  +• 


.,(  m)  ,    .   m 

+  N    (c,  )y. 


These  results  are,  as  pointed  out  by 
Butcher,  essential  in  connection  with  the 
practical  implementations  of  implicit  RK- 
methods.   In  particular  we  are  interested 
in  RK-methods  where  the  eigenvalues  are 
real  and  hopefully  equal.   Two  questions 
arise  :   first  which  collocation  points 
do  we  need  in  such  a  case  and  secondly 
what  is  the  maximum  order  of  the  corre- 
sponding method. 

An  upper  bound  on  the  order  can  be 
given  by  considering  the  order  of  rational 
approximations  R(z)  to  exp(z)  where  R  has 
real  poles.   For  example,  we  can  show  by 
using  the  theory  of  order   stars   developed 
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by   V.  -.he 

following   interesting   result    : 

Let  =Pk(  ),    Pk  €TTk  ,    Q.    €  Tij 


.as  r  <  j  complex  different 
sn  the  order  s  satisfies 


s  < 


if  Q  has  no  real  zero 


k+r+1  if  J   has  at  least  one 
real  zero  . 


This  result  was  earlier  obtained  for 
the  case  r  =  0  by  N0rsett  &  Wolfbrandt  in 
a  rather  more  technical  way.   The  proof 
is  now  short  and  is  a  counting  of  bounded 
dual  fingers  versus  unbounded  dual 
fingers . 

The  A-stability  of  these  methods,  and 
also  of  all  one-step  methods  giving 
y,  =  R(z)y   when  applied  to  the  test 
equation  y1  =  Ay,  reduces  to  the  question 
of  A-acceptability  of  R(z)  =  Pfc(z)/Q. (z), 
i.e.  |R(z)|  <  1  for  z  £  C".  For  the 
optimal  order  approximation,  s  =  k+j ,  R 
is  an  entry  in  the  Pade-table.   By  intro- 
ducing the  concept  of  order   stars    (see 
the  talk  by  G.  Wanner)  one  easily  proves 
that  the  diagonal  and  the  first  two  sub- 
diagonal  entries  are  the  only   A- 
acceptable  approximations. 

When  Q  has  only  real  zeros  y.  =  Y> 
I  =  1,«»«tk,  k  =  j,  the  approximations 
have  the  form  (of  order  at  least  k) 


R(z 


)  =  Q  (-1  )mL(k-m)  (1/Y)(Yz)m)  /  (1-yz)"  , 


approximations  is  as  the  following  table 
shows 


k 

12       3       5 

V 

1112 

When  y  is  a  solution  to  L  (1/y)  =  0, 
Q   £  v  and  L-acceptable  approximations 

mav  appear.   Some  results  in  that  direc- 
tion  will  be  discussed  as  well  as  the 
I-acceptability  of  the  special  approxi- 
mations 


R(z) 


P  „  (z)/(1-yV  )k 

2  k 


suitable  in  connection  with  integrating 
discretized  hyperbolic  equations.   Some 
of  the  questions  rised  by  Baker  and 
Bramble  who  first  introduced  these 
functions  will  be  dealt  with. 

For  further  details  see  the  following 
papers , 

|1|   G.  Wanner,  E.  Hairer,  S.P.  N^rsett; 

"Order  stars  and  stability  theorems", 

BIT  1 8  (1978),  475-489. 
|2|   S.P.  N^rsett,  G.  Wanner; 

"The  real-pole  sandwich  for  rational 

approximations  and  oscillation 

equations" . 

To  appear  BIT. 
|3|   S.P.  N0rsett,  G.  Wanner; 

"Perturbed  collocation  and  Runge- 

Kutta  methods". 

Submitted  to  Num.  math. 


y  e  R  . 

Optimal  order  k+1  is  obtained  for  y  a 
solution  to  L!   (1/y)  =  0.   Let  the  k 
different  Y-values  be  denoted  by  y    with, 
0  <  1  /y      <  1/y   < <  1/Y...   for  each 

1  1         '2  k 

of  these  Y-values  we  obtain  the 
restricted-Pade-approximations ,  and  an 
interesting  property  is  that  the  order 
star  corresponding  to  y  =  Y   contains 
just  one   bounded  finger  of  multiplicity 
k-v+1.   Based  on  that  result  the  A- 
acceptability  of  the  restricted-Pade- 
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SAFE  POINT  METHODS  FOR  SEPARABLY  STIFF  SYSTEMS 


J  D  Lambert 
University  of  Dundee 


§1   Introduction 

The  solution  of  the  2*2  linear  constant 
coefficient  system  y'  =  Ay  ,  y(x0)  =  yn  ,  where 

A  has  real  eigenvalues  X.  <<  X~  <  0  and   eigen- 
vectors CpCj  is  y(x)  =  k.  (exp(X.x))c.  + 
K2(exp(X2x)c2  represented  in  the  following 
diagram. 


X,  real,  *,  «  t=2^.,mVt  <  ° 


(1.1) 


P1[x,(K1+E])(exp(X]x))c]  +  (K2+e2)(exp(X2x))c2] 

is  a  numerical  approximation  to  the  exact  solution 
point  P  .  Even  if  Pf   is  close  to  P  ,  the  high 

gradient  at  Pj   invokes  instability  if  an  explicit 

method  is  used.   Correcting  P.   in  the  c.- 

direction  was  proposed  by  Lambert  flj  and  developed 
by  Alfeld  and  Lambert  [2]  into  the  CDS  technique 
(Correction  in  the  Dominant  Space). 

The  new  proposal  is  to  correct  P.   in  the 

direction  of  the  tangent  to  the  local  integral 
curve  through  Pj  .  For  all  Kj  ,  this  tangent 

passes  through  S[x-I /Xj , (<2+e2) (I-A-/X. ) 

(exp(X2x))c_]  ,  which  we  call  the  safe  point,  since 

it  invokes  no  high  gradients. 

Consider  the  m  *  m  linear  system 
y'  ■  Ay  +  g(x)  ,  where  A  has  eigenvectors   {cf} 
and  eigenvalues   {X  }   such  that 


If  (1.1)  is  satisfied  we  say  that  the  system, 
and  the  matrix  A  ,  are  simply  separably  stiff.   The 
exact  solution  at  x  =  x  is  represented  by  the 
point  P[x",k.  (exp(X.x)c.+^(x)  ]  ,  and  the  numerical 

solution  by  P.  Cx,  (<•  +e. )  (exp(X  .x)  )c.+tKx)+<5  (x)  ]  . 

The  tangent  at  P.   to  the  local  integral  curve  now 

contains  the  safe  point  S[x-1 /X. ,ij)(x)+6(x)- 

0J>' (x)+6' (x))/X.  J  ,  which,  note,  no  longer  lies  in 

the  subdominant  space.   Since   1/X.   is  small  we 

could  accept  the  solution  at  S  as  an  approximation 
to  y(x)  ;  this  invokes  a  persistent  error  (cf  CDS) 
of  n'(X)/X,  .   More  advantageously,  we  can  accept 

the  solution  at  S  as  an  approximation  to 
y(x-l/X.)  ,  which  invokes  a  persistent  error  of 

n"(x)/xf  . 

§2  One-step  Safe  Point  Methods 

Problem 

y'  =  A(x)y  +  g(x)  =:f(x,y)  ,  y(xQ)  =  yQ  , 
y,g  £  K  ,  eigenvalues   {X  (x)}   satisfying  (1.1) 
V  x  e  [x0,xN]  . 

Procedure 

(1)  Starting  from  x   ,  take  forward  step,  length 
nn  i  by  conventional  explicit  method,  the  basic 
method. 

(2)  Compute  X.   by  the  power  method.   (Note 

(i)  the  stiff er  the  better!  (ii)  estimates  for 
initial  iterate  available  from  previous  step; 
(iii)  eigenvectors  not  needed  (cf  CDS).) 


(3)  Proceed  to  safe  point,  and  accept  as  an  approxi- 
mi 

I 


mation  to  y(x  +h  +5  .,)  ,  where 


'n+1 


■1A,(VV 


This  gives  an  irregular  grid  of  data  points,  hence 
the  motivation  for  one-step  basic  methods,  which 
will  yield  numerical  solutions  on  the  grid 

(xJ*n-X0+jWV.)}  • 
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Algorithm  (SPM)  , 

z   ,  ■  v  +  h  £,(x  ,v  ;h  ) 
n+1    n    n  f  n'-'n*  n 


'*l  =  V.  +  W(Wzn+l 


(i) 

(ii) 

)(iii) 


(2.1) 


Local  TE 

p  the  order  of  (i) 


0(hP+1)  ♦  0(d+1)  , 
n         n+ 1 


Note:   It  is  of  course  possible  to  start  with  the 
correction  step  (iii);  this  is  equivalent  to 
applying  (2.1)  to  a  perturbed  form  of  the  problem. 

§3  Other  Safe  Points 

Regard  (2.1  (iii))  as  an  Euler  step  with  £ 

as  discretization  parameter  (cf  singular  perturba- 
tion?) .   We  could  replace  this  by  RK   (any 

q-stage  explicit  Runge-Kutta  method  of  order  q  , 
q  S  4)  .  Clearly,  from  Fig  1,  S  can  no  longer  be 
the  safe  point.   It  turns  out  that  RK.  has  no 


safe  point,  but 


Y/X 


RK..  has  a  safe  point  given  by 
■1.596071638  .  Using  this  in  place 


of  (ii)  and  replacing  (iii)  by  an  RK..  method, 

3 
(2.1)  yields  a  method,  which  we  label   (SPM),  , 

with  local  TE  =  0(hp+  )  +  0(£  .)  ;  the  persistent 

error  can  now  be  safely  ignored,  even  for  systems 
of  modest  stiffness.   In  the  sequel  we  consider 
only  correction  steps  of  the  form  given  in  (2.1), 
but  in  every  case  replacement  by  a  correction  based 
on  RK,  is  possible. 

§4   Stability  and  Sensitivity  of  (SPM) . 

Definition 

Let  A  be  simply  separably  stiff  and  have 
distinct  eigenvalues  {X  | t  =1,2,  . . . ,  m}  .  Then 

a  SPM  is  said  to  be  dominantly  stable  if  all  solu- 
tions of  the  difference  equation  in   {y  }  which 

result  from  applying  the  method  to  y1  =  Ay  tend 


to  zero  as  n  ■*  °°  for  all  step  lengths  h 


such 


that 


Vt  e*B  ■  C  =  2«  3' 


,  m  ,  where  J£  B  is 

the  region  of  absolute  stability  of  the  basic 
method. 

Theorem 

3 
(SPM) |   and   (SPM).   are  dominantly  stable. 

Sensitivity 

Applying  (2.1(i))  to  y1  =  Ay  yields 

z   ,  ■  P(h  A)y   ,  where  P(-)   is  a  polynomial  of 
n+ 1      n   n 


degree  p 


Dominant  stability  depends  on  the 

0  when   t  =  1  .   If 


fact  that   (l+£A  )P(h  X  ) 

~      n  ' 
we  denote  by  X.   the  value  of  X.   computed  by 

the  power  method,  then  stability  is  retained  only 

if   |(l-At/A  )P(h  X  )|  <  1  ,  which  leads  to  the 

,?     ,  ..  ,r    ,.  .  .  -d 

an  excessive 


T 


n  1 

,-x,: 


requirement   (X.-X.)/A  ~  (hX.)" 


precision.   Analogous  difficulties  beset  CDS  when 
the  basic  method  is  one-step. 

§5  Linear  Multistep  SPM 

In  order  to  obtain  a  solution  on  a  regular 
grid,  we  propose  to  correct  by  one  forward  (Euler) 
step  to  the  safe  point  and  then  back  by  a  further 
(Euler)  step.   This  process  retains  the  same  safe 
point  (eg  Fig  1).   We  now  use  as  basic  method  the 
K-step  explicit  LMM   (p*,cr*)  . 


Algorithm  (SPM). 


zn+k  =  (Ek-p*(E))yn+ha*(E)f(xn,yn)    (i) 


'n+k 


■!A,(xn+k) 


(ii) 

Vk  =    Zn+k  +   W^WW  (iii) 

W  =  ?n+k   "  W(Xn+k+WVk}    (iv) 


•  (5.1) 


Local  TE  =  0(hp+l)  +  0(£2  .  ) 

n+k 
3 
(SPM),   is  similarly  defined,  replacing  (iii)  and 

(iv)  by  RK3   steps.   (SPM),   is  absolutely  stable 

if  the  polynomial 

7r(r;hAt,£At):=  rk  -Qt[rk-p*(r)+hXta*(r)  ]   (5.2) 

is  Schur  for   t  =  1 ,  2,  ...,  m  ,  where 

2 
Qt  =  1  -  (£Xt)   .  This  is  clearly  so  for  t  =  1  , 

while  for  t  =  2,  ....  m  ,  it  =  p*  -  hA  a*  +  0(X  /Xj  )2 

and  we  expect  a  stability  region  little  different 
from  that  of   (p*,CT*)  . 

For   (p*,a*)  =  4th  order  Adams-Bashf orth  (5.2) 
is  Schur  in  the  following  region  (where  we  assume 


I 


I' 


It  follows  that  for  t  =  2,  . . . ,  m  the  interval  of 

absolute  stability  is  only  slightly  less  than  that 

of  AB,  .   Let   t  =  1   in  diagram,  and  for  Q, 

~  2 
read   1  -  (X./X.)   .   Then  stability  is  retained  if 

(QphX.Q.)   lies  in  the  indicated  parallelogram, 

which  implies  the  much  improved  sensitivity 
requirement  h|X.-X.|  <  .077  . 

§6  Predictor-corrector  SPM 

A  predictor   (p*,a*)  and  corrector   (p,a)   in 
PECE  mode  usually  has  better  absolute  stability 
than   (p*,0  )  .   Applying  the  correction  process 
(5.1 (iii) (iv))  after  the  PECE  cycle  -  indicated  by 
PECE£  -  raises  the  sensitivity  problems  encountered 
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with   (SPM),  .   However,  for  the  mode  VE^CEE,    ,  (5.2)  is 
(5.2)  is  replaced  by 

tt  =  (l-Qt)(l+hXt6kQt)rk  + 

Qt[p-hXta+hXtBkQt(p*-hXta*)]  . 

An  order  of  magnitude  argument  suggests  a  good  com- 
promise between  sensitivity  and  subdominant  stabi- 
lity is  got  by  setting  to  zero  terms  in  hX  Q   , 

which  forces  the  choice  a     =  B.r   ,  ie   (p,a)   is 

a  backward  differentiation  formula  (BDF) .   Further, 
it  is  found  to  be  advantageous  to  choose  p   =  p  . 
The  PL  pair  is  then  completely  specified  by 

(p,a)  =  BDF  of  order  k  ; 

p*  =  p  ;   (p*,cr*)  of  order  k  . 

§7  Extensions 

The  extension  to  the  general  separably  stiff 
linear  problem  (ie  more  than  one  dominant  eigen- 
value) is  straightforward;  but  computing  estimates 
for  the  dominant  eigenvalues  becomes  more  difficult. 
Nonlinear  systems  may  also  be  tackled  by  evaluating 
X.  =  X.(x,y)  ,  dominant  eigenvalue  of  the  Jacobian, 

at  appropriate  approximate  values  for  y  .   Success 
will  depend  on  the  eigensystem  of  the  Jacobian 
varying  slowly  with  x  . 
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Many  problems  in  theoretical  chemistry  re- 
quire an  understanding  at  the  microscopic  level 
of  molecular  scattering.   That  is,  an  understand- 
ing of  what  internal  and  external  changes  take 
place  as  two  molecules  approach  one  another,  col- 
lide and  then  recede  from  one  another.   From  this 
knowledge  at  the  microscopic  level  one  can  then 
calculate  many  properties  of  macroscopic  chemical 
systems  which  cannot  be  measured  presently  in  the 
laboratory  nor  observed  directly  in  our  environ- 
ment. 

The  applications  of  molecular  scattering 
information  are  many  and  far-reaching.   In  laser 
physics  this  information  is  needed  in  the  search 
for  new  chemical  systems  with  better  lasing  pro- 
perties and  for  performance  improvement  of  exist- 
ing ones.   The  discovery  within  the  past  ten 
years  of  a  rich  variety  of  molecules  and  mole- 
cular clouds  in  interstellar  space  has  created  a 
need  in  astrophysics  for  molecular  scattering 
data.   Many  interstellar  molecules  cannot  be  made 
in  earth-bound  laboratories,  leaving  computation 
as  the  only  method  for  the  determination  of 
their  properties.   The  study  of  our  own  atmos- 
phere involves  many  molecular  scattering  pro- 
cesses, including,  for  example,  the  formation 
and  destruction  of  ions  in  the  upper  atmosphere 
and  the  creation  of  air  pollutants.   Many  energy- 
related  problems  also  require  understanding  of 
molecular  scattering  events.   To  understand 
combustion  processes  requires  knowledge  of  the 
associated  reactive  scattering  events  and  in 
nuclear  reactors  there  are  many  inelastic  and 
reactive  collisions  involved  in  the  final  dis- 
position of  the  generated  energy.   This  list- 
ing is  just  a  brief  mention  of  a  few  of  the 
applications. 

Quantum  mechanical  approaches  to  mole- 
cular scattering  can  be  reduced  to  the  solu- 
tion of  a  system  of  second-order,  ordinary 
differential  equations  of  the  form, 
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y"(r)  +  k  y(r)  =  V(r)y(r) 

k2>0 

y(0)  =  0 

lim  y(r)  =  sin(kr+n)  +  cos(kr+n)R 


n 


constant 


(1) 
(2) 
(3) 

(4) 
(5) 


Here  V(r)  is  called  the  potential  energy  function 
or  most  often  simply  the  potential.   Another  impor- 
tant class  of  equations  have  non-local  potentials, 
i.e.  , 


y"(r)  +  k2y(r)  =/   V(r  ,r  '  )y  (  ')dr  • 


(6) 


However,  the  discussion  here  will  be  limited  to 
the  local  case.   Eq.  (1)  will  generally  be  a 
matrix  equation  with  V(r)  a  real,  symmetric  matrix 
and  k  diagonal.   In  this  case  the  sin  and  cos 
functions  in  Eq.  (4)  are  to  be  regarded  as  diag- 
onal matrices  and  R  is  real  symmetric.   The  matrix 
R  is  the  desired  quantity.   From  it  all  micro-  and 
macroscopic  scattering  information  can  be  derived. 

This  problem  is  most  commonly  solved  by  start- 
ing with  Eq.  (3),  assuming  an  arbitrary,  non- 
singular  matrix  for  y'(0)  and  then  numerically 
integrating  outward  to  some  large  r  such  that 
V(r)=0.   the  numerical  solution  then  has  the  form 


y(r)  =  sin(kr+n)A  +  cos(kr+n)B 


(7) 


and  the  desired  solution  is  found  by  multiplying 
from  the  right  by  A-1.   That  is 


R  =  BA 


(8) 


The  single  equation  problem  illustrated  in 
the  figure  below  is  representative  of  the  nature 
of  the  problem.   For  small  r  where  V(r)>k2,  the 
solution  has  exponential  behavior.   For  large 
r  where  V(r)<<k2  the  solution  is  oscillatory. 
Various  methods  have  been  devised  to  compute 
these  functions,  some  of  which  try  to  take  advan- 
tage of  their  known  limiting  behavior.   A  brief 
survey  of  these  methods  with  representative 
references  follows. 
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With  the  exception  of  Gordon's  method3 
computer  programs  for  these  methods  are  not  gene- 
rally available  to  or  usable  by  people  other  than 
the  developers.   At  the  National  Resource  for 
Computation  in  Chemistry  (NRCC),  we  are  now 
attempting  to  bring  together,  test  and  make  avail- 
able the  best  of  these  programs.   At  this  meeting, 
we  hope  to  stimulate  a  broader  dialogue  on  such 
problems  between  theoretical  chemists  and  the 
community  of  numerical  mathematicians. 
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The  Numerov1  and  de  Vogelaere2  algorithms  are 
straight  forward  and  have  found  wide  application. 
The  method  of  Gordon3  which  makes  a  piecewise 
linear  approximation  to  V(r)  is  especially  advan- 
tageous in  the  large  r  region.   The  variable  phase 
method1*  represents  y  as  a  product  of  an  oscillator 
y  function  and  a  modulating  function  which  is 
smooth  and  slowly  varying  over  a  large  part  of  the 
integrating  range.   Several  methods  '   have  also 
been  developed  for  integrating  the  integral  equa- 
tions equivalent  to  Eq.  (1).   In  addition  a  num- 
ber of  unconventional  methods  have  recently  been 
developed.   These  include  the  L-  methods7  (the 
solutions  y  are  expanded  in  a  basis  of  square 
integrable  functions) ,  the  R-matrix  propagator 
method  and  an  iterative  method  for  finding  a 
single  column  of  the  solution  matrix  y.9   It  is 
also  possible  to  formulate  the  problem  as  a 
smaller  system  of  linear,  second-order  partial 
differential  equations  and  finite  element  methods 
for  solving  these  have  recently  been  proposed. 

Because  of  the  matrix  multiplication  in  Eq. 
(1),  the  computing  time  required  for  most  of  these 
methods  increases  as  N3  where  N  is  the  order  of 
the  matrix.   The  maximum  cases  solved  to  date 
typically  involve  N-100.   The  greatest  need  at  the 
moment  in  molecular  scattering,  is  for  methods 
which  allow  a  significant  increase  in  N. 

Other  important  needs  for  improved  algorithms 
.)  faster,  more  efficient  programs,  2)  error 
estimates  with  variable  and  automatic  step  size 
sel'        md  3)  programs  which  are  generally 
ible  and  easy  to  use. 
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INTRODUCTION.   Historically,  the  development  and 
analysis  of  methods  for  ordinary  differential  equa- 
tions (ODEs)  have  been  more  advanced  than  those  for 
partial  differential  equations  (PDEs).   The  present 
state  of  numerical  methods  is  no  exception;  there- 
fore, it  behooves  the  numerical  analyst  to  exploit 
sophisticated  ODE  methods  for  the  numerical  solution 
of  PDEs. 

The  method  of  lines,  for  example,  capitalizes 
on  reliable  ODE  packages  to  solve  PDEs.   However,  as 
the  number  of  spatial  dimensions  increases,  the 
direct  application  of  the  method  of  lines  produces 
excessively  large  systems  of  ODEs.   As  an  illustra- 
tion, consider  the  two-dimensional  hyperbolic  system 


(1) 


_3u    3F(u)    3G(u) 
3t     3x      3y 


where  u  is  an  m-component  vector.   For  the  equa- 
tions of  gas  dynamics  m  =  4  and  if  the  spatial 
derivatives  in  (1)  are  replaced  by  spatial  differ- 
ences and  the  computational  mesh  is  chosen  with 
50  grid  points  in  each  spatial  direction,  the  end 
result  is  a  system  of  10  ODEs.   While  this  system 
might  not  overwhelm  the  more  robust  ODE  packages, 
the  three-dimensional  counterpart  for  practical 
physical  problems  would  produce  a  system  of  order 
10  stiff  nonlinear  ODEs  —  a  formidable  challenge 
for  any  ODE  package.   The  stiff  character  of  the 
system  implies  that  an  efficient  solution  algorithm 
must  be  implicit,  hence  we  limit  our  discussion  to 
implicit  methods.   Clearly,  the  direct  application 
of  the  method  of  lines  to  multidimensional  PDEs  does 
not  take  advantage  of  the  sparse  structure  of  the 
ODE  system  —  unless  the  ODE  package  is  appropriately 
designed  [7]. 

In  the  development  of  implicit  numerical  meth- 
ods for  PDEs,  the  greatest  improvement  in  numerical 
efficiency  was  the  introduction  of  alternating 
direction  implicit  (ADI)  methods  [5,6,10],  which 
reduce  multidimensional  computational  problems  to  a 
sequence  of  simpler  one-dimensional  problems.   In 
the  context  of  the  method  of  lines,  ADI  methods  are 
analogous  to  an  approximate  factorization  of  the 
large  matrix  which  must  be  "inverted"  by  the  stiff 
ODE  package.   Although  in  principle  the  approximate 
factorization  could  be  designed  into  the  ODE  pack- 
age, in  practice  it  would  be  difficult  without  an 
a  priori  knowledge  of  the  PDE  system.   A  more  effi- 
cient technique  is  to  discard  the  direct  application 
of  the  method  of  lines  and  introduce  the  ODE  methods 
prior  to  the  spatial  differencing. 

The  purpose  of  this  paper  is  to  demonstrate 
that  the  methods  and  analyses  from  the  numerical  ODE 
development  can  be  directly  applied  to  the  develop- 


ment of  efficient  PDE  algorithms.  We  choose  the 
linear  multistep  methods  (LMMs)  and  focus  on  those 
which  are  A-stable.   As  a  vehicle  for  the  demonstra- 
tion we  outline  the  development  of  an  A-stable  ADI- 
LMM  for  a  mixed  hyperbolic-parabolic  system.   We  do 
not  propose  the  direct  application  of  existing  ODE 
packages;  however,  it  is  hoped  that  many  features 
of  these  packages  (variable  time  step,  error  con- 
trol, etc.)  can  be  implemented  in  PDE  packages. 

PRELIMINARIES.   Consider  a  nonlinear  mixed 
hyperbolic-parabolic  system  of  the  form 

3u   3F(u)   3G(u) _  av(u,ux»"y)   3W(u,ux,uy) 
{Z)    3t    3x     3y        3x     +     3y 

where  u  is  an  unknown  m-component  vector,  and  F 
and  G  are  given  (in  general,  nonlinear)  vector 
functions  of  u.   The  vectors  V  and  W  are  vector 
functions  not  only  of  the  components  of  u  but 
also  of   3u/3x  and  3u/3y. 

Physical  problems  associated  with  equations  of 
form  (2)  (e.g.,  the  compressible  Navier-Stokes 
equations)  are  typically  stiff  by  virtue  of  dispa- 
rate characteristic  speeds  and/or  length  scales.  In 
addition,  the  parabolic  terms  on  the  right-hand 
side  of  (2)  are  often  negligible  except  in  thin 
layers  of  the  solution  domain.   As  a  result,  the 
eigenvalues  associated  with  a  locally  linearized 
version  of  (2)  are  widely  distributed  with  large 
imaginary  parts.   Consequently,  A-stable  integra- 
tion formulas  provide  ideal  time-differencing 
approximations . 

As  is  well  known,  the  order  of  accuracy  of  an 
A-stable  linear  multistep  method  cannot  exceed  two 
[2].   At  present,  the  application  of  higher  order 
A-stable  methods  such  as  multistep  multiderivative 
methods  appears  to  be  impractical  for  PDEs  as  com- 
plex as  (2).   However,  there  exists  a  large  class 
of  physical  problems  where  second-order  temporal 
accuracy  suffices.   For  the  equations  of  gas  dynam- 
ics at  high  Reynolds  number,  solutions  of  (2) 
exhibit  large  spatial  gradients  in  localized 
regions  such  as  shock  waves,  shear  layers,  boundary 
layers,  etc.   In  addition,  the  fastest  characteris- 
tic speeds  are  often  of  only  marginal  importance. 
For  such  problems,  numerical  schemes  which  combine 
fourth-order  spatial  accuracy  with  second-order 
temporal  accuracy  are  often  considered  "optimal"  in 
terms  of  both  storage  requirements  and  computa- 
tional effort  (work).   One  must  also  keep  in  mind 
that  (2)  is  an  initial  boundary  value  problem; 
consequently,  it  is  very  difficult  to  supply  the 
proper  boundary  conditions  (for  a  well-posed 
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problem)  at  the  advanced  time  level  when  higher  order 
temporal  methods  are  used. 

We  have  recently  shown  [13]  that  A-stable  LMMs 
combined  with  approximate  factorization  provide  a 
natural  framework  for  the  construction  of  uncondi- 
tionally stable  ADI  methods  for  PDEs.   The  main 
emphasis  of  [13]  was  linear  stability  theory  and  we 
considered  only  model  equations.   In  the  following 
section  we  outline  the  extension  of  [13]  to  non- 
linear PDEs. 

ALGORITHM  DEVELOPMENT.   A  linear  k-step  method  for 
(3)  jr  =   f (u)  ,    u(0)  =  u0 


can  be  written  as 


(4) 
where 


p(E)u  =  Ato(E)f 


k  k 

(5)  p(0  =  £  ai^  .   o(C)  =  £  6i?J  . 

1-0  l=o 

E  is  the  shift  operator,  Eu11  =  un+1,  and  At  is 
the  time  increment.  Henceforth,  we  consider  only 
A-stable  LMMs;  i.e.,  we  assume  (4)  is  A-stable. 

In  the  procedure  suggested  in  [13]  for  con- 
structing ADI  schemes,  the  implicit  operator  to  be 
"inverted"  is  constructed  so  that  the  unknown  vari- 
able to  be  determined  at  each  time  step  is  p(E)un 
rather  than  u   .   Hence,  as  a  precursor  to  formu- 
lating an  ADI  scheme,  we  rewrite  (4)  as 

(6)  p(E)un  -  wAtp(E)fn  =  At [o(E)  -  cop(E)]fn 

where   w  =  S^/a^.   The  parameter   oj   is  defined  so 
that  the  operator  o(E)  -  up(E)  on  the  right-hand 
side  of  (6)  is  at  least  one  degree  lower  than  the 
operator  p(E)  on  the  left-hand  side.   Hence  the 
right-hand  side  of  (6)  can  be  computed  explicitly, 
i.e.,  from  known  data  when  advancing  the  numerical 
solution  from  n  +  k-1  to  n  +  k.   The  restriction 
to  second-order  temporal  accuracy  allows  several 
approximations  to  be  made  which  greatly  simplify  the 
final  algorithm.   We  illustrate  these  approximations 
for  the  LMM  (6). 

(a)  Local  linearization.  Since  (6)  is  a  nonlinear 
system  of  difference  equations,  an  iterative  solu- 
tion procedure  such  as  the  Newton's  method  could  be 
used  at  each  time  step.  For  schemes  whose  order  of 
accuracy  is  <2,  iterative  solutions  can  be  avoided 
by  a  local  linearization.  By  using  a  local  Taylor 
series  and  the  consistency  condition  for  (4)  it  is 
easy  to  show  that 

(7)  p(E)fn  =  Jn+Pp(E)un  +  0(At2)  , 

where  J  is  the  Jacobian  matrix  J  =  3f/3u  and  p 
is  an  integer  in  the  range  0  <  p  <  k.   With  this 
approximation,  (6)  becomes 

(8)  (I-uAtJn+P)p(E)un  =  At  [0(E)-  wp(E)]fn 

which  is  a  second-order-accurate  linear  equation 
for  p(E)un. 

(b)  Quasl-one-leg  method.   The  evaluation  of  the 
right-hand  side  of  (8)  can  be  reduced  to  a  single 
function  evaluation  by  noting  that 


where 


[o(E)  -  ujp(E)]fn  =  f(u)  +  0(At?)  , 


U    \'-(Y.)    -  wp(E)]un 


This  is  a  variant  of  one-leg  methods  [3]  correspond- 
ing to  the  LMM  (4).   Hence  (8)  becomes 

(9)    (I  -  u)AtJn+P)p(E)un  =  Atf(u)  +  0(At3)  . 

Example:   The  linear  one-step  method  p(E)  =  E  -  1, 
o(E)  =  9E  +  (1  -  6),  o(E)  -  o)p(E)  =  1,  is  A-stable 
for  9  >  1/2  and  is  second-order  accurate  when 
6  =  1/2.   For  this  example,  algorithm  (9)  becomes 


(I 


6AtJ  )(u 


u  ) 


Atf(un) 


This  noniterative  ODE  method  was  suggested  indepen- 
dently by  Lomax  [9]  and  Liniger  and  Willoughby  [8]. 
This  scheme  is  equivalent  to  solving  (4)  by 
Newton's  method  and  stopping  the  iteration  after 
one  step. 

(c)  Approximate  factorization.   For  PDE  applica- 
tions it  is  convenient  to  split  (3)  as 


(10) 


dt  '  f(u) 


fj(u)  +  f2(u) 


The  Jacobian  is  then 


3u 


3f, 


3f, 


(11)  J 
and  the  algorithm  (9)  becomes 

(12)  [I-u)At(Jn+p  +  J2+P)]p(E)un  =  Atf(u)  . 
An  approximate  factorization  of  (12)  yields 

(13)  (I-uAtJ^+P)(I-uAtJ2+P)p(E)un  =  Atf(G)  . 

It  is  shown  in  [13]  that  this  approximate  factoriza- 
tion does  not  upset  either  the  second-order  temporal 
accuracy,  or  the  unconditional  stability  of  the 
method  when  applied  to  linear  test  equation 
du/dt  =  Xiu  +  A2u  (ReX!,A2  <  0).   The  A-stability  is 
preserved  because  we  have  chosen  p(E)un  as  the 
unknown  variable. 

For  simplicity,  we  illustrate  the  PDE  applica- 
tion by  considering  only  the  hyperbolic  equa- 
tion (1).   The  algorithm  for  the  complete  equa- 
tion (2)  will  be  given  in  the  final  manuscript. 
Comparing  (10)  and  (1),  we  identify  the  differen- 
tial operators 


(14) 


fj(u) 


3F(u) 
3x 


f,(u) 


3G(u) 
3y 


The  Jacobians  (11)  are  replaced  by  the  operators 

3 


(15) 


'1 


3x 


3y 


B 


where  A  =  3F/3u,  B  =  3G/3u  are  Jacobian  matrices. 

(For  the  equations  of  gas  dynamics  in  two  spatial 

dimensions,  these  are  4><4  matrices.)   Inserting 

(14)  and  (15)  in  (13)  yields* 

(16)   (i  +  ooAt  £  An4 


)(l  +uAt  ^Bn+P)p(E)un 

=   »tpF(u)  +  5G(u)] 
L  3x      3y  J 


An  obvious  computational  sequence  to  implement  (16) 
as  an  ADI  method  is 


*In  (16),  notation  of  the  form 

3 


denotes 


(I  +  uAt  ■£-   B)  p(E)u 
dy 


p(E)u"  +  uAt  ^  (Bp(E)u11) 
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(17a)   (i  +  uAt  ^An+PJp(E)u* 

I  a*  3y  J 

(17b)  (l  +  -.At  ^  Bn+P)D(E)un  =  o(E)u*  , 


(17c)    u 


where  o(E)u*   is  a  dummy  temporal  difference.   The 
spatial  derivatives  appearing  in  (17)  are  to  be 
approximated  by  appropriate  finite-difference  quo- 
tients.  For  three-point  spatial  difference  approx- 
imations, the  x-  and  y-operators  on  the  left  side 
of  (17a, b)  each  require  the  solution  of  a  block- 
tridiagonal  system  of  equations  with  each  block 
having  dimensions  m  *  m,  where  m  is  the  number  of 
components  of  the  vector  u.   There  is  an  obvious 
extension  of  (17)  to  three-spatial  dimensions. 

In  practice,  two-step  A-stable  methods  are  of 
primary  interest.   The  class  of  all  two-step  methods 
which  are  at  least  second-order  accurate  can  be 
written  as 


(18)   (1  +  Ou 


n+2 


(1  +  2?)un+1  +  Cu" 


=  At  [efn+2  +  (e  -  29  +|)  fn+1   -  (5  -  6  +   f  )  fn]  , 

where  9,£  are  arbitrary  real  numbers.  By  applying 
the  theory  of  positive  real  functions  [4],  one  finds 
that  (18)  is  A-stable  if  and  only  if 


5  <  29  -  1  , 


S>--f 


MIXED  SPATIAL  DERIVATIVES.   The  appearance  of  mixed 
spatial  derivatives   3z/3x3y  on  the  right-hand 
side  of  (2)  precludes  an  efficient  reduction  of  a 
multidimensional  problem  into  a  product  of  one- 
dimensional  operators.   This  difficulty  can  be 
avoided  by  treating  the  mixed  derivatives  with  an 
explicit  LMM  which  is  "compatible"  with  an  A-stable 
LMM.   We  demonstrate  the  rather  remarkable  result 
that  this  can  be  done  and  still  retain  unconditional 
stability.   The  only  penalty  is  that  the  parameter 
space  for  A-stability  is  slightly  reduced. 

NUMERICAL  COMPUTATIONS.   Finally,  we  give  a  brief 
survey  [1,11,12]  of  recent  two-  and  three-dimensional 
flow  computations  made  using  ADI-LMMs  of  the 
form  (17). 
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TOLERANCE  PROPORTIONALITY 
IN  ODE -CODES 

Hans  J.  Stetter 
Tech.  Univ.  Vienna 


1 .  Meaning  of  Proportionality 

Most  routines  for  the  numerical  solution  of 

(1  .1  )    v' (t)  =  f(t,y(t)),    y(0)  =  y  , 

t  £  [D,T],    y(t)  £]RS  , 

request  the  specification  of  an  accuracy  level 
through  a  tolerance  parameter  6  €  IR.  The  choice  of 
6  determines  the  computation  for  a  given  problem 

(1 .1 )  so  that  all  computational  quantities  are  de- 
pendent upon  6 . 

Originally,  the  numerical  solution  of  [1  .1 ] 

by  a  given  code  is  only  defined  on  a  "grid"  G(6)  = 

{t  ;  n  =  0(1  )N}  c:  [0,T] ;  by  a  suitable  interpolation 
n 

(see  section  2)  we  may  generate  an  approximation 

n(t,6}  defined  for  all  t€  [0,T],  with  global  error 

e(t,6)  :=n(t,6)  -y(t).  Tolerance  proportionality 

means  that 

(1.2)  e(t,6)  «*<5  •  vCt), 

with  a  function  v:  [0,T]-»1R   independent  of  6, 
holds  for  an  interval  of  6-values  including  those 
commonly  used.  (1 .2)  permits  an  assessment  of  e 
from  the  results  of  two  computations  with  differ- 
ent tolerances  6 . 

A  vaguer  form  of  proportionality  occurs  in 
the  Toronto  Assessment  Programs  ([1],  [2]).  Here 
the  efficiency  of  a  code  in  finding  an  approxi- 
mate solution  of  a  given  accuracy  for  a  given 
problem  (1  .1 )  at  a  given  point  t  is  measured  on 
the  basis  of 


2 .  Local  equivalent  of  proportionality 

The  numerical  solution  of  (1 .1  )  by  a  one-pass 
forward  step  method  cannot  be  controlled  with  a 
global  strategy.  But  (1  .2),  with  v  unspecified,  is 
not  a  global  requirement,  at  least  not  for  small  6: 
Within  first  order  perturbation  theory  it  is  equi- 
valent to  ( cf . , e .g . , [3] )  the  local  requirement 
that  n(t,<5)  satisfies 

(2.1)    n'(t,6)*«f(t,n(t,6))  +6  •  y(t), 

with  y:  [0.T]  -»1R   independent  of  6.  (2.1  )  can  be 
controlled  on  a  step-by-step  basis. 

For  specified  values  n   .  and  ri  6  IR  ,  there 
n-1      n 
s 

is  a  unique  vector  u  £  IR   such  that  the  solution 
of 


z'(t)  =  f(t,z(t)  )  +u  ,  z(t  , 
n     n-1 


'n-1' 


satisfies  z(t  )  =  n  .  We  assume  that o  ur  continuous 
n     n 

approximation  n(t,6)  has  been  determined  in  this 

fashion;  u  may  be  called  its  "backward  error"  in 
n 

(t   „,t  ),  cf.  [4].  (2.1)  requests 
n-1   n 

(2.2)    u  (6)  «6  •  y(t)  in  [t   ,,t  ]. 
n  n-1   n 

It  is  well-known  (see  [3]  or  [4])  that  u 
and  the  "local  error  per  unit  step"  L  are  re- 
lated by 


(2.3) 


U  (1  +0(h  )); 
n       n 


thus  proportionality  requires  L  to  be  proportional 
to  6.  n 


(1.3)   lle(t.6)ll«6  «v(t), 

where  e  and  v(t)  are  determined  by  a  logarithmic 
least  squares  fit. 

In  the  following,  "proportionality"  will 
refer  to  the  stricter  relation  (1  .2)  unless  spe- 
cified. We  will  regard  various  effects  in  cur- 
rent ODE-codes  for  their  impact  on  the  achieve- 
ment of  proportionality. 


For  a  code  "of  order  p",  with  L  =  0(h  ),  this 
1/p  n     n 

implies  h   =  0(6   p) .  Thus,  even  if  we  manage  to 

control  the  computation  properly  w.r.t.  leading 

powers  of  h,  we  must  expect  (1  +0(6    )) -factors 
in  the  proportionality  relation  (1.2)  if  we  re- 
place the  ra-sign  by  equality.  Therefore  we  need 
not  bother  about  (1  + 0( 6) ) -factors  which  arise 
from  linearization  w.r.t.  6  in  (2.1)  or  from  the 
evaluation  of  quantities  along  y(t)  in  place  of 
n(t,6). 


We  assume  that  the  actual  control  of  the 

computation  is  effected  through  a  vector  L    €  ]R 

n 

which  is  computed  in  each  step:  On  the  basis  of 
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L     and  5,    an  acceptance  procedure   determines  wheth- 


er  to   accept   the   current   step  or  not,    and  a   step- 
.  and  order)    control   procedure   calculates   the 
stepsize    (and  order)    to   be   used   in   the   next    Cor  re- 
peated)   step. 


3.    Local   Error  Estimator  Effects 

In  a  one-step  method  of  order  p,    we   may  com- 
pute L      from  any  procedure  Lit    „  ,n„_ , -h_jf )   which 


n 
satisfies 


n-1'  'n-T  V 


(3.1)        L(t,y(t).h,f)    =     hP(p(t+h)M+0(h)) 


xaptur  Lffacts 

We   assume  that  the   acceptance  procedure   deci- 
des on   the  basis 


(4.1) 


^        $6        accept 
HLnll 

>  6   reject 


where  II.  ..II  may  include  a  weighting,  e.g.  by  the 
current  solution  values.  (The  type  of  weighting  and 

the  choice  of  ]R  -norm  are  supposed  to  be  independent 
of  6.) 

If  we  have  a  local  error  estimation  of  type 
(3.1),  we  have 


if  we  only  achieve 

(3.2)  L   =  6  •  *(t  )(1  +0(h  )) 

n         n        n 

through   our  control.    For  a  backward  error  u     of 
order  p.    (3.1)/(3.2)    implies 

(3.3)  u      =   hPip(t    )    (1  +  0(h    )) 

n  n  n  n 


=   hP<p(t    )  diag 
n  n 


=   5y(t    )  diag 


Vpitn); 
UnrrJ 


(1  +0(h     )) 
n 


(1  +0(h 


(4.2) 


IMt 


<p(t    )(1  +0(h    )  ), 

n  n 


thus   the   fact   that  we    consider   a   norm  only    in    (4.1) 
does   not    hurt,    and   the  weighting  may   as   well    be   as- 
sumed  to   use   true   solution    values    (cf.    section    2) 
and  thus    to   be   completely   independent   of   6. 

If  we    could   deduce   ||L   II  «    8    from   (4.1),    (4.2) 
n 

would  immediately   yield   the    relation    (3.2)   which 

was   assumed   in    section    3.    Actually,    the   fact   that 

the   usual   acceptor  does   not  contain   a   lower  bound 

on   IlL   II    may    seriously    impair  proportionality: 


=   6y(t    )(1  ♦CKh    1). 
n  n 

which  is   our   requirement    (2.2).    The   difficulty 
which   may  arise   if  a   component   of  ip  vanishes   will 
be   considered  in   section   6.    Note   that   the  error 
estimate    (3.1)    need  not  be   asymptotically   correct, 
i.e.    ip  may   differ  from  the   principal   error  func- 
tion ip. 

In   a  multistep  PC-method,    the  above   analysis 
remains   valid  only   if   at   each   step   change   new  back 
values   are   computed  by  a   sufficiently  accurate   in- 
terpolation procedure    (of  order  p+1).    If  the   pre- 
vious  grid  remains   unchanged,    the   expression   for 
oaokward  error  becomes 

(3.4)        u      =    [P„(h    )<p„(t    )  +P.(h    )ip_(t    )]    (1+0(h)) 
n  1n1n  2n2n 

2 
where  the  polynomials  P  (h)  of  degree  p  have  h  - 
m 

terms  as  their  lowest  powers  so  that  u  =  0(h  )  if 
the  previous  steps  remain  fixed;  cf.  e.g.  [5], 
Similarly,  the  commonly  used  local  error  estima- 
tors behave  like  P(h  )<p(  t  ),  with  P(h  )  =  D(h2)  but 
A  n    n  n       n 

P*P  ,  m  =  1,2.  Only  if  we  use  an  asymptotically 

correct  local  error  estimator,  which  is  far  more 

expensive,  we  will  have  a  parallel  behavior  of  u 

and  A  .  n 

n 

Thus,  in  a  customary  variable  step  multistep 
code  (like  STEP)  even  if  we  would  manage  to  keep 
L  strictly  proportional  to  5  this  would  not  be 

fully  true  for  u  .  As  long  as  no  violent  step  chan- 
ges occur,  the  effects  are  not  dramatic;  but  they 
add  an  other  element  of  uncertainty  to  the  use  of 
"fictitious"  (viz.  lower  order)  local  error  esti- 
mates. 


Consider  a  high  order  code  which  is  conserva- 
tive in  increasing  the  stepsize  so  that  values  of 
|L  ||  around  0.1  6  do  not  lead  to  a  change  in  step- 
size  for  a  long  interval  of  integration.  In  this 
case  a  change  of  6  by  a  factor  of  100  may  not  chan- 
ge the  computation  at  all  and  the  global  error  re- 
mains insensitive  to  changes  of  6  over  a  wide  ran- 
ge of  6-values.  When  6  is  further  increased,  the 
global  error  may  suddenly  jump  to  a  much  larger 
value . 

Thus  the  inequality  in  (4.1)  will  permit  an 
undermining  of  strict  proportionality  while  it  may 
not  seriously  affect  the  vague  proportionality  men- 
tioned in  section  1. 


5.  Stepsize  Control  Effects 

A 

Since  the  acceptor  will  not  prevent  1 1 1 II  from 

n 

assuming  values  far  smaller  than  5,  the  stepsize 
control  must  be  designed  such  that  ||L  II  is  kept 

close  to  6.  Actually,  in  view  of  (4.1),  (4.2),  and 
(3.2),  a  proper  strategy  is  to  aim  at 


(5.1) 


|L  II  «*  r&.  r  <  1  fixed, 

n 


For  one-step  methods  it  can  be  proved  that  the 
following  procedure  implies  (5.1)  under  the  assump- 
tion of  (3.1),  at  least  for  smooth  ll<p(t)lh 

2 


(5.2) 


h 


IlL 


old' 


this   remains   true  even   if   rejections   occur  due   to 
a   growth   in   lltpll  . 
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Actually,  (5.2]  is  the  usual  strategy  for  one- 
step  codes  where  the  stepsize  may  bs  changed  virtu- 
ally without  cost;  special  provisions  are  added  for 
very  small  and  very  large  values  of  6/11 L  II.  Thus, 

at  least  at  this  fundamental  point,  strict  propor- 
tionality is  supportable  by  a  reasonable  code. 

For  multistep  methods  which  use  interpolation 
at  a  stepsize  change,  the  same  strategy  (5.2)  would 
be  indicated;  but  due  to  the  cost  involved  one  will 
tend  to  introduce  a  threshhold  for  changes  of  h. 
Such  treshholds  may  seriously  disturb  the  achieve- 
ment of  (5.1],  particularly  for  high  order  methods. 


and  3  on  the  relative  location  of  the  singularity 
w.r.t.  the  current  computation,  the  proportionality 
to  6  which  would  otherwise  follow  from  (6.3)  (inter- 
preted as  h  a||Af||»6)  is  weakened.  But  it  is  sur; 
sing  that  such  a  strong  trace  of  proportionality 
be  shown  to  remain  even  after  a  jump  in  f. 

Another  problem  dependent  difficulty  was  po: 
ted  out  in  section  3:  components  of  (p   may  pass 
through  zero.  Due  to  the  restriction  to  a  norm  if 
(4.1)  and  (5.2)  this  becomes  fully  effective  only 
for  scalar  problems  (1.1);  but  some  damage  may  also 
appear  when  a  strongly  dominant  component  of  the 
local  error  estimate  changes  sign  in  a  system  (1.1). 


For  multistep  methods  which  adapt  their  co- 
efficients to  the  local  grid  structure,  we  have  the 
difficulty  exposed  in  section  3  (cf.  (3.4)).  Here 
it  is  unknown  what  a  strategy  maintaining  (5.1) 
should  look  like,  even  if  we  could  neglect  the  cost 
of  changing  h  in  each  step.  The  usual  strategies 
applied  in  such  codes  attempt  to  keep  the  stepsize 
constant  over  longer  intervals  and  use  (5.2)  if  a 
change  has  become  necessary.  It  seems  that  this 
tends  to  maintain  "vague  proportionality"  in  the 
sense  of  section  1.  Certainly,  strict  proportiona- 
lity cannot  generally  be  expected  under  these  cir- 
cumstances . 


B.  Problem  Effects 


So  far,  we  have  assumed  that  the  problem  (1.1) 
is  sufficiently  smooth  to  let  the  order  of  the  me- 
thod become  effective,  see   (3.3)  and  (3.1).  Let  us 
now  consider  what  happens  if  the  computation  cros- 
ses a  finite  jump  discontinuity  in  f . 

Assume  that  the  local  error  estimate  is  based 
on  a  lower  order  local  solution  so  that 


(6.1) 


h  1  , 
n  n 


A     ,  p-1 , 
with    1   =  0(|-i    ). 
n      n 


In  a  wide  class  of  codes,  1   is  essentially  a  lin- 

n 

ear  combination  of  evaluations  of  f;  if  f  has  a 

jump  along  the  solution  trajectory,  1   consists  of 

a  "smooth  part"  which  would  have  arisen  with  a 
smooth  continuation  of  f  and  the  contribution  from 
the  jump  in  f: 


(6.2) 


h  (1  +  a  Af ), 
n  n 


a  is  some  factor  which  may  depend  on  the  position 
of  the  jump  relative  to  the  step. 

A 

If  h  has  to  be  reduced  because  of  ||L  II  >6, 

~       n        d-1  n 

L  decreases  like  h    and  acceptance  occurs  essen- 
*  n 

tially  for 
(6.3) 


h  a  IIAf  II  si  6 
n 


Similarly,  the  contribution  of  this  step  to 

the  global  error  consists  of  two  parts:  The  effect 

of  the  "smooth"  backward  error  (normally  very  small 

due  to  the  smallness  of  h  )  and  an  immediate  incre- 

n 

ment  in  e(t)  due  to  a  wrong  consideration  of  Af. 

This  second  error  is  of  the  form  h  BAf,  its  effect 

on  the  global  error  further  on  is  thus  proportional 

llAfll.  Due  to  the  irregular  dependence  of  o 
n 


Consider  the  scalar  case,  with  ip(t)  »  a(t-t)  in 
the  vicinity  of  t.  Then  (5.2)  and  (3.1)  imply 


n-1 


t  -t 
n 


n-1 


so  that  the  stepsize  will  grow  locally  until  t  >t; 
but  the  details  of  that  growth  will  depend  on  the 
relative  location  of  t  in  the  grid  and  thus  on  6  in 
an  irregular  manner.  Thus  there  will  be  large  u   in 

the  vicinity  of  t  ( cf .  (3.3)  and  note  that  ip(t)  *  0), 
but  their  size  will  generally  not  be  proportional  to 

6 .  This  may  introduce  a  considerable  perturbation 
of  strict  proportionality. 


7.  Starting  Effects 

The  choice-of -starting-step  dilemma  is  well- 
known:  On  the  one  hand,  it  is  unwise  -  and  contrary 
to  the  intentions  of  numerical  software  design  -  to 
delegate  this  choice  to  the  user;  on  the  other  hand, 

the  values  of  y   and  f(t  ,y  )  are  simply  not  suf- 
o        o  ■'o 

ficient  to  let  the  code  choose  a  reasonable  first 

step,  cf .  e.g.  [6  ] . 

From  the  point  of  view  of  achieving  proportio- 
nality, it  is  extremely  undesirable  to  have  an  in- 
itial interval  in  which  the  stepsize  (and  hence  the 
local  error)  is  considerably  smaller  that  the  speci- 
fied tolerances  would  require;  cf.  section  4.  The 
following  suggestion  seems  to  be  practical: 

Design  a  procedure  which  computes  an  h.  from 
y  ,  f(t  ,y  ),  and  6  which  would  be  the  proper  step- 

size  if  some  heuristic  assumptions  about  the  problem 
were  true.  Execute  the  first  step  and  test  for  ac- 
ceptance. Repeat  the  step,  with  a  new  stepsize  com- 
puted from  (5.2),  not  only  if  IlL.II  >  6   but  also 
if  IlL.II  <  c  6,  with  a  reasonable  value  c  <  1.  De- 
tails may  be  found  in  [6]. 

Such  a  procedure  will  prevent  ill-effects  from 
unduly  small  starting  stepsizes.  The  same  procedure 
should  also  be  used  for  restarting  after  a  singular 
condition,  like  a  jump  in  f,  has  been  recognized  by 
the  code: 

-4 
Assume  that  h  has  to  be  reduced  to  10   in  or- 
der that  a  discontinuity  in  f  may  be  passed  but  that 
the  smoothness  of  f  after  the  jump  permits  steps  of 
order  10"''  under  the  specified  tolerance.  With  or- 
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itepsize  control  strategies,  the  computation 
many  steps  to  return  to  the  appropriate 
stepsize.  Therefore  a  restart,  even  if  tt  involves 
a  repetition  of  the  first  step  will  be  much  more 
jmic  besides  enhancing  proportionality. 


which  is  indicated  by  the  bi         in  fig.  . 

without  the  (1  +  0(h))-fa<  jrs  gem.  r 

under  (8.1)  and  (8.2)       .own.  It  may  be  ai 
that  the  differences  are  0(h  +      it  this  i 
leading  power  on  the  level     of  the  newly  genera- 
ted error  per  step.  -, 


8.  Output  Effects 

So  far  we  have  assumed  that  steps  are  chosen 
exclusively  to  comply  with  local  error  require- 
ments (except  in  "emergency  situations").  In  many 
codes,  however,  these  requirements  are  overridden  by 
the  requirement  to  reach  output  points  precisely.  If 
there  is  a  majority  of  steps    determined  in  this 
fashion,  proportionality  must  obviously  become  com- 
pletely obsolete. 

But  even  if  we  consider  output  at  the  end  T  of 
an  integration  interval  only,  the  situation  is  not 
trivial.  A  common  and  natural  strategy  is  the  fol- 
lowing: Determine  h    by  the  local  error  vs.  toler- 
ance requirements;     use  the  stepsize 


(8.1) 
h 


- 


r  T-t      if  T  € (t  ,t  +h  _  ], 
n  n  n   n+1 

(T-t  )/2  if  T€(t  +h  _,t  +2h  A, 

n  n  n+1   n    n+1 


Also  in  codes  where  the  output  values  are  ob- 
tained by  interpolation,  care  has  to  be  taKen  if 
strict  proportionality  is  not  to  be  abandoned  in 
the  output  process.  Fig. 2  shows  the  source  of  the 
difficulty.  Some  related  analysis  may  be  found  in 
[  ]. 


e  (t .  5)' 


fig.  2 


n+1 


if  T?t  +2h   „, 
n    n+1 


Thus  normally  the  last  two  steps  before  T  are  mo- 
dified. 

A  simple  analysis  shows  that  -  from  the  point 

of  view  of  proportionality  -  it  is  better  not  to 

looK  ahead  beyond  t  +h  „ : 
n   n+1 


F  -t    if  T€(t  ,t  +h  J, 

n  n  n   n+1 


(8.2)    hn+1= 


fig.  1 


n+1 


if  T  £  t  +h  „ 

n   n+1 


newly 
generated 
error  at  T 

A 


rvM 


n+2 


This  is  best  explained  by  fig.1.  Assume 


.  <T 

- 


t  <h    +h   ,  where  h  ..    and  h  ._  are 
n   n+1   n+2        n+1      n+2 


defined  by  local  error  requirements.  According  to 

1/(2.2),  the  newly  generated  error  in  [t  ,T] 
should  be  n 


(8.3) 


6(T  -  tn)  Y  (T)  (1  +0(hn+1)) 


9.  Mild  Stiffness  Effects 

We  do  not  wish  to  consider  here  the  chances 
for  proportionality  in  the  application  of  a  "stiff 
code"  to  stiff  systems  (1.1).  There  are  some  par- 
ticular aspects  of  this  situation  which  require  a 
special  analysis:  A  constant  error-per-unit-step 
control  seems  not  to  be  advisable  (see  [7])  and  the 
damping  of  transient  errors  may  be  stronger  for 
large  h  in  L-stable  methods. 

Here  we  consider  a  typical  "non-stiff",  i.e. 
explicit  code  and  a  mild  stiffness  in  (1.1).  At  a 
crude  tolerance  which  permits  relatively  large  steps, 
our  basic  assumption  (3.3)  on  the  backward  error 
in  a  p-th  order  method  may  no  longer  hold.  The  ty- 
pical situation  in  a  multistep  method  is  shown  in 
fig. 3.  (only  real  negative  eigenvalues  are  consid- 
ered) . 

If  the  local  error  requirement  applied  to  the 
principal  root  C  permits  hA  < ( hA)  for  the  dominant 
or  for  a  subdominant  component,  a  "parasitic"  con- 
tribution to  the  error  will  grow  until  its  effect 
is  large  enough  to  affect  the  stepsize  control.  Al- 
though the  reaction  of  the  control  will  not  be  ap- 
propriate (since  it  is  geared  on  a  hD-behavior) , 
hX  will  eventually  move  to  the  safe  interval 
hA>(hA).  Here  the  local  error  is  too  small  so  that 
h  is  increased  again.  The  same  effects  occur  in  one- 
step  methods  if  we  consider  the  one  (principal) 
root  for  a  dominant  and  a  (mildly  stiff)  subdominant 
component  of  the  solution. 

We  may  say  that  in  such  a  situation  the  com- 
putation is  controlled  by  stability  rather  than  by 
accuracy.  Since  the  stability  requirement  has  no- 
thing to  do  with  the  specified  tolerance  6,  a  com- 
putation which  is  predominantly  controlled  by  sta- 


10-4 


bility  cannot  be  expected  to  yield  a  global  error 
proportional  to  6. 


We  may  even  define  a  given  problem  (1.1)  to 
be  stiff  for  a  given  non-stiff  code  at  a  specified 
tolerance,  if  the  computation  is  no  longer  control- 
led by  local  error  requirements  throughout.  In  a 
high-order  Adams  PECE-code  stiffness  in  this  sen- 
se should  be  very  common  as  a  quantitative  analy- 
sis shows: 


Define  (hX)  by 


hA 


(9.1)      IS      (hA) I  2  .9  e   for  hA  < (nX; 
paras. 


and  let  us  characterize  the  location  of  (hX)  by 
the  corresponding  number  of  "locally  correct  di- 
gits" 


In  a  "variable  procedure"  code  we  can,  of 
course,  not  assume  that  the  same  procedure  is  used 
in  a  certain  section  of  the  integration  interval  for 
different  <5 .  This  affects  our  considerations  at 
their  roots: 

In  section  3,  we  were  able  to  permit  the  fic- 
titious local  error  estimate  (3.1),  with  ipfcp,  be- 
cause we  could  assume  (p(t)  to  be  essentially  inde- 
pendent of  6;  thus  a  function  y  was  generated  in 
(2.2)  which  was  also  essentially  independent  of  6. 
If  there  may  be  different  functions  (p  for  different 
<5 ,  the  argument  in  section  3  breaks  down. 

Furthermore,  even  if  we  have  an  asymptotically 
correct  error  estimator  and  achieve  II  u  II  w  6  for  all 
n,  this  does  not  imply  (2.2)  in  the  variable  pro- 
cedure case.  Different  vectors  Y(t)  of  identical 
norm  may  lead  to  quite  different  solutions  of  (2.1). 

Thus  there  is  not  much  to  expect  with  regard  to 
proportionality  from  a  variable  procedure  code,  no 
matter  whether  it  uses  local  extrapolation  or  not. 
On  the  other  hand,  this  does  not  mean  that  propor- 
tionality, at  least  of  the  vague  type,  cannot  occur 
with  such  codes:  If  there  is  sufficient  regularity 
in  the  relation  between  the  various  high  order  deri- 
vatives, the  resulting  function  value  Y(t)  in  (2.2) 
and  (2.1)  may  not  depend  strongly  on  the  particular 
procedure  which  is  used  in  the  neighborhood  of  t. 

Actually,  it  has  been  experimentally  estab- 
lished that,  e.g.,  the  variable  order,  variable 
step  multistep  code  STEP  which  also  uses  local  ex- 
trapolation exhibits  a  considerable  degree  of  vague 
proportionality  for  a  wide  variety  of  problems. 


rTTTi    (hX) 
?   •    (hA J  -  e 
10.     princ. 
-  log  — ^— 


(hA) 


11 .  Conclusions 


Then  a  situation  is  stiff  for  a  K-step  Adams 
PECE-code  (p=K+1)  if  the  tolerance  requests  fewer 
than 


K         4 
locally  correct 


digits 


1  .5  2.5  3.75   5 


B.5 


10 
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10.  Change  of  order  (method) 

So  far  we  have  made  the  silent  assumption 
that  the  tolerance  parameter  6  influences  the 
computation  through  the  selection  of  the  stepsize 
only.  However,  some  of  the  most  efficient  codes 
are  based  on  varying  order  or  varying  basic  inte- 
gration procedures. 

This  is  well-known  for  Adams  multistep  codes; 
the  recent  construction  of  families  of  embedded 
Runge-Kutta  procedures  ([9])  suggests  a  similar 
approach  for  one-step  methods.  This  approach  is 
further  supported  by  results  which  demonstrate 
that  lower  order  methods  are  more  efficient  at  cru- 
de tolerances  (  [l]  j  . 


The  preceding  considerations  should  have  made 
it  clear  that  the  output  of  a  robust  and  efficient 
code  for  initial  value  problems  (1.1)  cannot  be  ex- 
pected to  exhibit  more  than  the  vague  proportiona- 
lity between  the  global  error  and  the  specified 
tolerance  6  which  has  been  described  in  section  1 . 
An  exponent  e » 1  in  (1.3)  should  appear  only  if  the 
least  squares  fit  is  based  on  a  wide  range  of  to- 
lerances . 

If  6  is  varied  only  over  a  narrow  range,  with 
many  samples  taken,  say  6  =  10~31-  ■  '  '    ,  and  if  the 
values  of  the  individual  error  components  are  re- 
garded in  place  of  the  norm  of  the  error  vector, 
we  should  not  be  surprised  if  few  traces  of  propor- 
tionality are  observable  before  smoothing. 

This  means  that  we  cannot  reliably  judge  the 
size  of  the  global  error  from  running  the  problem 
twice  with  different  tolerances.  But  this  was  the 
main  advantage  that  we  hoped  to  obtain.  If  we  can- 
not have  it  we  may  as  well  forget  about  proportio- 
nality, at.  least  form  a  utilitarian  paint  of  view. 

Even  if  our  codes  were  more  perfectly  propor- 
tional, the  ordinary  user  would  certainly  not  bother 
to  run  them  twice  at  different  tolerance  levels  if 
they  would  give  him  an  approximate  value  of  the 
global  error  directly.  I  do  not  believe  that  he 
would  be  very  fastidious  w.r.t.  the  accuracy  of  this 
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iate  value;  the  order  of  magnitude  might  be 
t  in  many  cases.  He  might  even  accept  an 
-;ional  blunder  without  much  fuss.  (Precisely 
rehavior  is  shown  by  the  users  of  quadrature 
lea  where  the  reliability  of  the  error  assessment 
-.  tter.  ) 

If  we  admit  that  an  assessment  of  the  global 
ir  should  be  incorporated  into  ODE-codes,  the 
tion  is  whether  it  can  be  done  more  efficiently 
:-ing  the  codes  more  proportional  or  by  a  di- 
rect approach.  From  my  experiences.  I  would  favor 
the  second  choice;  see  [10]. 

Nevertheless,  the  analysis  of  the  proportiona- 
lity properties  of  ODE-codes  yields  important  in- 
sights into  the  mechanism  of  these  codes. 
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Introduction. 

In  many  technical  applications,  model  problems 
are  defined  by  systems  of  ordinary  differential  e 
quations 


f(t,y; 


t  e  [o,t] 


y  e  R 


(D 


where  n  is  a  large  number,  and  where  the  Jacobi- 
an  matrix 


-fil 


i,j  =  1,2,. 


,,n 


(2) 


is  a  sparse  matrix.   In  this  paper  we  shall  discuss 
the  problem  of  selecting  a  solution  method  and  show 
how  sparse  matrix  techniques  can  be  used  to  solve 
the  systems  of  linear  equations  that  arise,  and  how 
it  may  be  implemented  efficiently. 

Solution  methods. 

The  systems  ( 1 )  that  concern  engineers  will  usu- 
ally be  characterised  by  having  a  wide  spectrum  of 
eigenvalues  for  the  Jacobian  J,  and  these  will  of- 
ten vary  along  the  solution.   This  means  that  only 
methods  for  stiff  systems  can  be  used  in  general 
purpose  software.   On  the  other  hand  the  engineer 
will  often  have  very  modest  requirements  of  accu- 
racy.  In  such  cases  methods  of  constant  order  will 
be  reasonably  efficient,  for  the  present  discussion 
we  have  selected  order  two. 

Often  it  will  be  advantageous  for  the  stepsize 
strategy  to  take  large  steps  in  spite  of  the  stiff 
character  of  the  system,  therefore  it  is  important 
that  the  region  of  absolute  stability  of  the  method 
contains  points  at  infinity  in  the  left  half  plane, 
and  that  damping  of  errors  in  the  stiff  components 
will  take  place.   In  some  applications  one  would 
also  be  interested  in  damping  of  highly  oscillatory 


components . 

These  objectives  can  be  achieved  if,  in  the  case 
of  one-step  methods  we  have  L(a )-stability ,  while 
in  the  case  of  linear  multistep  methods  we  could 
use  the  following  L,  (a  )-stability  definition:  (see 
ref.fi]). 

Definition:   A  linear  k-step  method 

k  k 

I   a.y   .  =  h  I   B.f  .. 

j=o  J  n+J   j=o  J  n+J 

is  L,  (a)-stable  if  (i)  it  is  A(a)-stable,  and  if 
k 

applied  to  the  testequation  y'  =  Xy  ,  Re(X)  <  0  , 
it  yields 


|yn+kl  <*(h*)  ma*|y11+il 


0  <  i  <  k-1  , 


such  that   |R(hX)|  -»  0  when   |x|  -»  »  , 
|arg(-X)|  <  a  . 

Of  the  methods  that  satisfies  the  above  require- 
ments we  have  considered  two  types,  the  Backward 
Differentiation  methods  and  Semi-Implicit  Runge- 
Kutta  methods,  for  order  two  we  will  use  the  fol- 
lowing formulae. 


n+1 


U       1  2 

3  n   3  n-1     3  n+1 


(3) 


for  error  estimation,  an  approximation  to  y'"  may 
be  found  by  using  backward  differencing.   The  SIRK- 
methods  are  of  N0rsett-type  (see  ref.[2]). 

k,   =  f(t  +yh,y  +Thk.  )  (U) 

1        n     n    1 

\       =   f(tn+62h,yn+621klh+yhk2) 

k3   =  f(tn+63h,yn+631k1h+632k2h+Yhk3) 

V1  =  yn  +  ^VWWV  (5) 

LE   =  h(c1-k1+c2-k2+c3-k3)  (6) 
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on. 

h  variable  stepsize  implementation  the  step- 
.  are  in  general  controlled  by  the  estimated 
local  errors.   In  the  stepsize  selection  strategy 
the  error  constants  will  influence  the  stepsizes 
chosen  in  such  a  way  that  the  smaller  the  error 

jtant  is, the  larger  the  stepsize  will  be  chosen. 
In  our  comparison  the  SIRK  will  have  a  smaller 

r  constant  than  the  BDF  method.   If  the  same 
error  tolerance  is  prescribed  we  may  expect  the 
SIRK  to  do  better  on  average  than  the  BDF  be- 
cause the  number  of  steps  will  be  smaller.  On  the 
other  hand,  Ngrsett  has  shown  (ref.[2]),  that  it 
is  possible  to  select  the  parameters  in  C*,5»6)  in 
such  a  way  that  the  k   found  in  step  n  can  be 
used  as  k   in  step  n+1  ,  so  that  only  two  k.'s 
have  to  be  found  in  each  step,  except  when  starting 
and  when  changing  stepsize. 

Each  of  the  formulae  (3)  and  (h)   is  a  nonlinear 
system  of  equations  that  has  to  be  solved  in  each 
step,  this  solution  is  in  general  solved  by  means 
of  quasi-newton  iterations,  that  take  the  following 
forms , 

in  the  BDF  case: 
in  the  SIRK  case: 


Vi  *2 


-■— ►  t 


(I-YhJ)e[*]=kU]-f(y    +vhkjs]) 

11  n  i 

kU+l]=k[s]-e[s] 
i  li 


(9) 

(10) 


The  matrices  A   will  be  changed  when  either 

the  stepsize  is  changed  or  when  the  convergence  is 

slow  and  the  Jacobian  updated.   If  our  system  is 

large  the  factorizations  of  the  matrices  A   will 

r 

be  very  expensive  and  the  main  part  of  the  total 
work  involved  is  spent  on  that.  If  sparse  tech- 
niques is  used  the  work  can  be  reduced  but  it  is 
essential  that  as  much  information  is  carried  a- 
long  about  the  earlier  factorizations. 

Application  of  sparse  techniques. 

The  subroutine  SSLEST  has  been  constructed  so 
as  to  facilitate  the  solutions  in  such  cases.   By 
using  ordered  lists  for  storing  only  nonzero  ele- 
ments and  by  selecting  pivots  so  that  the  number  of 
fill-in's  is  small,  each  factorization  is  made  very 
efficient.   Further  if  as  in  the  case  above  the  se- 
quence of  matrices  A   all  have  the  same  structure, 
r 

then  the  information  in  the  pivotal  strategy  may  be 
used  in  subsequent  factorizations,  thereby  saving 
a  great  amount  of  work,  in  general  this  amount  will 
be  about  60%  of  the  total  factorization.  This 
sparse  matrix  package  has  been  used  in  connection 
with  the  SIRK  method  in  a  subroutine  SPARKS 
that  will  be  presented,  and  compared  to  the  use  of 
BDF  methods . 

Conclusions. 


In  each  case  J  is  an  approximation  to  the  Jaco- 
bian, this  is  only  updated  in  case  the  convergence 
is  too  slow.   It  is  convenient  to  consider  the  pro- 
cess written  in  the  form 


Ar  6n-1  "  bn+1 


(11) 


where  A  =  (I  -  YhJ )  and  b   ,   is  one  of  the 

r  n+1 

right  hand  sides  (7)  or  (9)  both  formulae  have  this 

general  form.   If  we  consider  the  time  history  of 

the  solution  we  will  find  a  sequence  of  matrices 

A   ,  r  ■  0,1, that  can  be  displayed  in  the 

form 


The  use  of  sparse  matrix  techniques  in  connec- 
tion with  methods  for  stiff  systems  for  ODE's 
with  sparse  Jacobians,  can  be  very  effective.   Two 
types  of  methods  have  been  compared,  the  BDF 
methods  versus  the  SIRK  methods ,  both  are  suf- 
ficiently stable  to  allow  the  use  of  large  step- 
sizes.   Efficient  implementations  with  a  sparse 
package  is  presented. 
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Abstract 


Most  methods  for  solving  stiff  systems  are 
based  on  implicit  formulas  and  require  the  use  of 
Newton-like  iterations.   The  cost  of  the  matrix 
operations  in  the  iteration  scheme  of  these 
methods  can  be  quite  high. 

A  new  iteration  scheme  is  developed  which 
exploits  the  structure  of  the  system  and  also 
allows  fast  updating  of  the  iteration  matrix 
after  a  stepsize  or  order  change.   The  technique 
is  particularly  useful  when  the  stiffness  is  due 
to  only  a  few  components  of  a  large  system,  and 
is  applicable  to  most  methods  for  stiff  systems. 
It  is  based  on  an  automatic  partitioning  of  the 
system  into  subsystems  corresponding  to  stiff  and 
non-stiff  parts.   This  partitioning  is  then  ex- 
ploited in  the  solution  of  the  system  of  equations 
that  must  be  solved  on  each  time  step. 

1.   Introduction 

In  most  methods  for  solving  stiff  systems, 
the  solution  of  a  system  of  equations  on  each 
time  step  by  a  modified  Newton  iteration  usually 
consumes  a  considerable  amount  of  the  compu- 
tational effort  of  the  method. 

One  way  to  achieve  efficiency  is  to  take 
advantage  of  the  structure  of  the  problems  and 
develop  techniques  to  solve  problems  of  the  same 
structure.   For  example,  problems  with  banded  or 
sparse  Jacobians  can  be  solved  efficiently  by 
methods  with  iteration  schemes  based  on  banded 
or  sparse  (linear  system  solvers),  rather  than 
using  general  methods  (see  Hindmarsh  [1976]). 

If  the  system  can  be  partitioned  into  stiff 
and  non-stiff  parts  (transient  and  smooth  com- 
ponents), more  effective  techniques  may  be 
developed  to  reduce  the  cost  of  the  iteration 
scheme.   The  idea  of  partitioning  the  system 
into  transient  and  smooth  parts  is  an  old  idea 
and  is  related  to  the  singular  perturbation 
problem,  where  the  system 

y(t)  -  f(y(t)),  y(0)  =  yQ  (1.1) 

is  transformed  into 


Du 


f1(u,v)) 
f2(u,v)i 


(1.2) 
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where 

D  is  a  matrix  which  may  be  singular 
u  corresponds  to  transient  components 
v  corresponds  to  smooth  components. 

As  early  as  1952,  Dahlquist  used  a  method 
called  Pseudo  Stationary  Approximation  (PSA)  in 
which  the  transient  components  are  eliminated  by 
putting  D=0,  thus  reducing  the  system  to  a  dif- 
ferential algebraic  system.   The  integration  can 
then  be  done  by  any  conventional  ODE  solver. 
This  method  was  revised  by  Dahlquist  [1969]  and 
analysed  and  tested  by  Oden  [l97l].   The  new 
version  is  called  SAPS  (for  smooth  approximate 
particular  solution) . 

Partitioning  of  linear  systems  of  the  form, 
y  =  Ay  +  b(t),  where  A  is  a  matrix  with  constant 
elements  and  with  the  large  eigenvalues  well 
separated  from  the  small  eigenvalues,  has  been 
considered  by  Lee  [1967]  and  by  Branin  [1967]  in 
connection  with  circuit  theory. 

Lee  used  matrix  filtering  to  divide  the  system 
into  loosely  coupled  subsystems,  one  for  the 
smooth  components  and  one  for  the  transients. 
The  smooth  and  transient  components  are  obtained 
by  using  the  transformation 

y1   =  LP  y,  y2  =  HP  y; 

where  LP  and  HP  are  matrices  which  act  like 
filters.  LP  passes  the  eigenvectors  of  A  that 
correspond  to  small  eigenvalues  and  annihilates 
eigenvectors  that  correspond  to  large  eigenvalues. 
HP  has  the  opposite  effect.   The  integration 
procedure  he  used  is  similar  to  the  PSA  method 
where  the  transient  components  are  eliminated 
(by  putting  y„  =  0)  and  the  smooth  components 

are  integrated  by  a  conventional  method. 

Branin  suggests  that  the  matrix  A  be  de- 
composed into  a  sum  of  diagonal  and  off-diagonal 
parts.   If  the  diagonal  part  has  large  eigen- 
values and  the  off-diagonal  part  has  small  eigen- 
values, the  restriction  on  the  stepsize  for  the 
formula  he  used  (which  is  based  on  an  analytical 
solution)  is  relaxed.   He  noted  that,  for  most 
practical  problems,  the  matrix  A  has  to  be  trans- 
formed first  so  it  can  be  decomposed  in  this  way. 
Branin  suggests  using  some  cycles  of  the  QR- 
algorithm  to  do  the  transformation.   He  also 
gives  guidelines  to  handle  nonlinear  systems, 
where  the  Jacobian  matrix  plays  the  role  of  the 
matrix  A,  but  is  changing  during  the  course  of 
the  integration.   In  this  case  the  transformation 
has  to  be  frequently  updated. 
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Methods  based  on  implicit  formulas  have  been 
widely  used  for  solving  stiff  systems.   However, 
using  these  methods  may  not  be  efficient  for  large 
systems  in  which  the  stiffness  is  caused  by  only 
a  few  components  of  the  system.   The  cost  of 
dealing  with  the  Jacoblan  of  the  full  system  can 
be  quite  high.   Dahlquist  has  pointed  out  that 
more  work  should  be  done  on  designing  software 
which  can  utilize  the  favorable  situation  which 
occurs  when  some  subsystems  are  not  stiff 
(Dahlquist  [1974]). 

Alfeld  and  Lambert  [197  7]  report  on  a  method 
where  the  correction  is  only  done  in  the  'domi- 
nant' space  (the  space  spanned  by  the  transient 
components).   They  consider  systems  of  the  form, 
y  =  A(t)y  +  g(t),  where  the  eigenvalues  of  A(t) 
may  be  separated  into  two  parts,  one  of  which 
dominates  the  other.   The  integration  consists 
of  taking  one  forward  step  by  a  conventional 
multistep  method  and  then  making  a  correction  in 
the  subspace  spanned  by  the  eigenvectors  corres- 
ponding to  the  dominant  eigenvalues.  The 
technique  requires  the  explicit  computation  of 
the  dominant  eigensystem.   Application  to  non- 
linear systems  is  also  discussed  where  the 
eigensystem  of  the  Jacobian  will  be  needed  at 
each  step. 

Recent  work  on  partitioning  stiff  system  has 
also  been  reported  by  Hoffer  [1976].  Hoffer's 
method  is  designed  to  solve  large  systems  in 
which  the  transient  components  originate  from 
only  a  few,  easily  identified  equations.   His 
integration  formula  is  a  combination  of  the  modi- 
fied midpoint  rule  applied  to  the  subsystem  of 
the  smooth  components,  and  the  implicit  trape- 
zoidal rule  applied  to  the  transient  components. 

In  the  partitioning  techniques  described 
above  the  difficulty  of  solving  linear  systems  of 
equations  on  each  time  step  is  reduced  somewhat 
by  applying  an  analytic  expansion  or  implicit 
formula  (with  the  modified  Newton  iteration)  only 
to  the  stiff  subsystem  and  applying  a  standard 
non-stiff  formula  to  the  remainder  of  the  system. 
The  accuracy  and  stability  of  such  techniques  is 
very  sensitive  to  the  correct  identification  of 
the  corresponding  subsystem  as  well  as  to  the 
amount  of  coupling  that  is  present  between  these 
subsystems.  It  should  also  be  noted  that,  even 
for  linear  problems,  the  appropriate  partitioning 
will  in  general  be  time  and  tolerance  dependent. 
The  technique  we  propose  is  based  on  applying  the 
same  implicit  formula  to  all  components  of  the 
system  and  exploiting  partitioning  only  in  the 
iterative  solution  of  the  resulting  systems  of 
equations.   As  a  result  only  the  cost  of  the 
Iteration  will  be  sensitive  to  the  correct  iden- 
tification of  the  corresponding  subsystem  and 
the  accuracy  and  stability  of  the  integration 
will  not  be  sensitive  to  the  partitioning. 

In  a  related  study,  Robertson  [1976]  has  con- 
sidered partitioning  of  the  Iteration  matrix  in 
the  iteration  scheme.   By  fixing  the  part  of  the 
iteration  matrix  which  corresponds  to  the  smooth 
components  and  only  updating  the  part  which  cor- 
responds to  the  transient  components,  the  iter- 
ation scheme  will  result  in  using  a  modlfied- 
Newton  technique  only  on  transients  and  repeated 
substitution  with  correction  terms  for  smooth 
components.   By  allowing  periodic  updating  of  the 
part  corresponding  to  the  smooth  components 
further  efficiency  can  be  achieved.   Robertson 


does  not  discuss  how  one  can  obtain  the  appropri- 
ate partitioning. 

Techniques  that  achieve  efficiency  in  solving 
stiff  systems  without  pre-information  about  the 
transient  and  the  smooth  components  are  more 
general  and  can  easily  be  incorporated  into 
existing  packages  for  stiff  systems.   The  fast 
update  technique  developed  by  Enright  [1978]  Is 
of  this  kind.   The  iteration  matrix  is  reduced 
to  Hessenberg  form  and  subsequent  updates  of  the 
representation  of  the  iteration  matrix,  after 
stepsize  or  order  changes,  require  fewer  oper- 
ations than  working  with  the  inverse  of  the 
iteration  matrix,  or  its  LU  factors. 

The  technique  to  be  developed  and  analyzed 
in  this  study  is  related  to  Enright 's  approach. 
As  the  iteration  matrix  is  reduced  using  a 
sequence  of  similarity  transformations,  a  test 
is  performed  to  identify  the  stiff  components. 
The  resulting  partitioning  of  the  system  leads 
to  an  iteration  scheme  which  adapts  to  the 
behaviour  of  the  problem  and  can  be  interpreted 
as  applying  a  modified  Newton  iteration  to  the 
transient  components  and  repeated  substitution 
to  the  smooth  components.   (The  idea  is  actually 
similar  in  spirit  to  the  identification  and 
solution  of  rank  deficient  least  square  problems 
by  the  method  described  in  Lawson  and  Hanson 
[l97A,pp. 77]) .   Substantial  saving  of  computa- 
tional operations  can  be  expected,  especially  if 
the  stiffness  is  due  to  only  a  few  components  of 
a  large  system. 

2.   The  Proposed  Technique 

The  application  of  an  implicit  multistep 
formula  to  the  system 


y  =  f(y),  y(tQ)  -  yQ, 

can  be  written  as 

0  =  G(y1+1)  =  yi+1-h60f(y,+1) 


(2.1) 


i+r 


■E  Yi+i-j  "  h£ 


3-i 


3=1 


3yi+l-3* 


(2.2) 


An  iteration  scheme  for  solving  the  nonlinear 
system  G(yi+1)  ■  0,  can  be  viewed  as: 


Predict  y   , 


set  6 


i+1 


'i+1 


yi+l  for  l   =  1'2 and 


determine  <5.   by  solving  the  linear  system 


i+1 


W6 


fc-1. 


i+1 


G(y~+p  for  I   -  1,2,.... 


(2.3) 


Different  iteration  schemes  will  result  from 
different  choices  of  W.   For  example,  if  W  is  the 
identity  matrix  then  the  iteration  corresponds  to 
repeated  substitution  or,  in  our  context,  a 
predictor  corrector  iteration;  while  if 
W  =  [l-h6  A]  where  A  ■  3f/3y  evaluated  at 

y.  (3  *  1+1).  the  iteration  corresponds  to  a 

modified  Newton  scheme. 

If  (2.1)  is  stiff  then  there  is  an  obvious 
trade-off  between  these  two  possible  choices  of 
W.  The  former  choice  is  cheap  (0(n)  operation 
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per  iteration)  but  in  order  to  guarantee  con- 
vergence the  stepsize  must  be  severely  restricted 
(| |hBQA| |<1).   On  the  other  hand,  the  latter 
choice  is  expensive  (0(n~)  operation  per  iteration) 

and  0(n  )  operations  when  W  has  to  be  updated), 
but  much  larger  stepsizes  are  possible.  Basically 
the  scheme  we  describe  is  a  compromise  between 
these  two  choices  of  W.   Our  idea  is  to  approxi- 
mate the  matrix  A,  of  the  Newton  iteration  by  a 
matrix  A  that  will  permit  efficient  iterations 
and  updates,  and  at  the  same  time,  include  enough 
information  for  the  iteration  to  converge. 

Before  we  discuss  the  technique  we  should 
review  the  technique  suggested  by  Enright  [1978] 
for  implementing  the  modified  Newton  iteration. 
He  observed  that  if  the  system  is  of  dimension  n 
and  W  ■  [i-hB-AJ  is  retained  in  its  LU  factored 

form  then  0(n  )  operations  will  be  required  wher- 
ever the  scalar  hBn  changes.   On  the  other  hand 

-1 
if  W  is  retained  in  its  LHL   factorization  (H  is 

an  upper  Hessenberg  matrix),  then  fast  updates 
are  possible  when  h6_  changes  and  only  when  A 

needs  to  be  updated  will  W  require  a  full  update. 
In  order  to  motivate  how  we  choose  X,  let  us 
rewrite  the  modified  Newton  iteration  as: 


accomplish  this  by  making  ||R,9||  small 


(A  -  (l/hB0)I)<5*+1  =  (l/hB0)G(y^+J) 


(2.4) 


Consider  the  factorization  of  A  by  the  similarity 
transformation: 


A  =  S  R  S 
where  R  is  of  the  form 


(2.5) 


H  R 
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\°  R22/ 
and,  H  is  an  k*k  upper  Hessenberg,  R..  -  is  an 
kx(n-k)  matrix,  R--  is  an  (n-k)*(n-k+l)  matrix. 

The  matrix  R  is  of  the  same  structure  as  the 
reduced  matrix,  A,  ,  at  the  k-th  step  in  the  re- 
duction of  A  to  upper  Hessenberg  form.   That  is, 
if  the  reduction  were  done  using  Householder 
transformations  then 

Ak"Uk-lUk-2---UlAUl"-Uk-2Uk-l 

where  U  ,  I  =  l,...,k-l  are  elementary 
reflectors. 

We  modify  this  approach  by  introducing 
pivoting  (row  and  column  interchange  on  each 
step).   R  is  then  the  reduced  matrix  at  the 
k-th  step: 

R  =  Vipk-i---uipiAPiur--pk-i\-i        (2-6) 


if 


R  by  che  matrix 


|  is  small  enough,  then  we  can  replace 


M2 


(2.7) 


and  A  can  then  be  replaced  by 

X  =  SRS-1. 

Note  that  ||A-A||F  <  ||R22MF. 

Returning  to  the  system  (2.4)  and  substituting 
A  for  A  we  get 

(A-(l/hB0)I)6*+1  =  S(R-(l/h80)l)s"16j+1  = 

(l/hB0)G(yJ;J)  (2.8) 

A  decomposition  algorithm  has  been  developed 
which  applies  the  transformation  as  in  (2.6).   The 
interchange  of  rows  and  columns  is  designed  so 
that  the  columns  of  R  „  with  large  norms  will  be 

the  pivot  columns.   The  value  of  k  is  determined 

as  the  smallest  integer  such  that  ||R0-||_  is  less 

22   r 

than  a  specified  threshold  value,  T.   The  cost  of 

this  decomposition  is  approximately  10/3  n  k 

2      3 
-7/3  nk  +4/3  k  adds  and  multiplies. 

The  matrix  H  is  updated  by  adding  (-1/hB-Jl 

and  the  resulting  matrix  is  factored  into  its  LU 

2 
decomposition  at  a  cost  of  k  /2  adds  and 

multiplies.   Then,  to  solve  (2.8)  we  must  apply 
the  following  four  steps: 

"c 


1.  form  the  vector  c 


2.  set  z2  =  -(hBQ)c2 


L°2j 


B^tt/h  0)G(yJ   ) 


3.  solve  Ly 


4  .  solve  U 


Cl  '  R12Z2 


5.  set  & 


i+1 


The  cost  of  solving  one  system  is  then  approxi- 
mately 5  n  k  -  5/2  k  adds  and  multiplies. 

In  the  remainder  of  this  paper  we  will  discuss 
the  selection  of  the  threshold  value,  T  and  the 
application  of  this  technique  to  existing  methods. 
We  will  also  present  some  numerical  results 
illustrating  the  performance  of  this  technique. 


where  P   is  a  permutation  matrix. 


Let  P  U  =  S  .   Then  the  matrix  S  in  (2.5) 


will  be 

S  -  S  .  .  .S 

1    k-1 

-1    T 
and  S   -  S  . 

One  would  prefer,  in  our  application,  that 

the  k*k  matrix,  H,  correspond  to  the  transient 

components.   The  interchanges  are  designed  to 
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IMPLEMENTATION  OF  IMPLICIT  FORMULAS 
FOR  THE  SOLUTION  OF  ODES 

L.  F.  Shampine 

Applied  Mathematics  Research  Department 

Sandia  Laboratories 

Albuquerque,  New  Mexico 


The  material  presented  in  the  talk  appears 
in  written  form  as  the  report  SAND79-069A  of  the 
same  title.   Copies  may  be  obtained  from  the 
author  on  request. 

Implicit  formulas  are  quite  popular  for  the 
solution  of  ODEs.   They  seem  to  be  necessary  for 
the  solution  of  stiff  problems.   Every  code  based 
on  an  implicit  formula  must  deal  with  certain 
tasks  studied  in  this  paper:   (i)  A  choice  of 
basic  variable  has  to  be  made.   The  literature  is 
extremely  confusing  as  to  what  the  possibilities 
are  and  the  consequences  of  the  choice.   These 
matters  are  clarified.   (ii)  A  test  for 
convergence  of  the  method  for  solving  the 
implicit  equations  must  be  made.   Ways  of 
improving  the  reliability  of  this  test  are 
studied.   (iii)  Deciding  when  to  form  a  new 
approximate  Jacobian  and/or  iteration  matrix 
is  a  crucial  issue  for  the  efficiency  of  a  code. 
New  insight  which  suggests  rather  specific 
actions  will  be  developed. 

The  report  closes  with  an  interesting 
numerical  example.   It  was  created  to  illustrate 
in  a  simple  way  a  difficulty  observed  in 
practice.   It  also  shows  that  in  very  reasonable 
circumstances,  (i)  the  implicit  formula  can  have 
more  than  one  solution,  (ii)  the  predictor  can 
be  closer  to  the  "wrong"  solution,  and  (iii)  any 
convergence  test  based  solely  on  the  difference 
of  successive  iterates  can  accept  an  approximate 
solution  from  a  divergent  iteration. 
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TAYLOR  SERIES  METHODS  WITH 
VARIABLE  STEPSIZE  RATIO  AND  VARIABLE  ORDER 
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Taylor  series  methods  compute  a  solution  to 
an  initial  value  problem  in  ordinary  differential 
equations  by  expanding  the  solution  in  a  long 
series.   This  solution  is  extended  across  the 
desired  interval  by  analytic  continuation.   We 
present  techniques  for  choosing  the  largest  ratio 
of  the  integration  stepsize  to  the  series  radius 
of  convergence  (Re)  which  may  be  taken  subject  to 
error  control  constraints.   This  optimal  ratio 
depends  on  the  length  of  the  series,  the  orders 
and  locations  of  the  primary  singularities,  the 
desired  error  tolerance  (e)  and  the  type  of  error 
measure  the  user  wishes  to  control.   Then  a 
series  length  can  be  chosen  which  minimizes  the 
computational  cost  of  solving  the  problem. 

Most  state-of-the-art  codes  for  solving 
ODE's  vary  both  the  stepsize  and  the  order  of 
the  formula  used  in  order  to  handle  efficiently 
a  variety  of  problems  and  a  range  or  error  toler- 
ances.  Several  authors  have  written  translator 
programs  which  accept  the  statement  of  the  system 
of  ODE's  to  be  solved  and  produce  object  code 
which  can  run  later  to  solve  the  system  using 
series  methods.   The  lengths  of  the  series  used 
varied  from  11  [1]  up  to  32  [5],  but  none  of 
these  codes  attempted  to  determine  the  most 
efficient  series  length  at  run  time.   We  have 
successfully  solved  examples  using  series  as  long 
as  1000  terms.   Chang  [2]  gave  an  example  for 
which  20  terms  was  found  empirically  to  be  a  good 
length.   Each  author  varied  the  stepsize  to  keep  it 
substantially  less  than  some  estimate  for  Re  to  con- 
trol the  error.   By  using  a  relatively  long  (20  to 
30  terms)  series,  Chang  was  able  to  give  an  accur- 
ate estimate  of  Re.   Then  a  step  of  approximately 
1/CH-l) 


Re-  e 


was  used. 


The  choices  of  an  optimal  stepsize  and  series 
length  require  the  accurate  estimation  of  Re  from 
tests  Involving  several  consecutive  terms  of  the 
series  outlined  in  [3]. 

Singularities  in  the  solutions  to  real  ODE's 
can  occur  with  any  order,  but  they  must  occur 
either  on  the  real  axis  or  in  conjugate  pairs. 
Without  loss  of  generality,  the  solution  y(x)  may 
be  expanded  at  x  -  0  with  series 

OO  CO 

Z   y(J)(0)  trVj!  -  E  Y   .  We  assume  for  the 
j-0  j-0  J+1 

moment  that  there  is  only  one  singularity  on  the 
circle  of  convergence.   Solutions  with  more  than 


one  primary  singularity  are  considered  later  in 
this  paper.   If  the  distance  to  the  closest  secon- 
dary singularity  is  R2 ,  then  the  contribution  from 
the  secondary  singularities  to  the  series  is 
0(|Rc/R2|N)  as  N  +«.   Hence  the  effects  are  neg- 
ligible if  a  series  is  long  enough.   Consequently 
we  consider  the  model  problem 
v(z;a,s)  =  (a  -  z)-s,  for  s  t   0,  -1,  ...  . 
N 


Let  v(z;a,s) 


Z   V,  be  expanded  using  a 
j=l  j 


step  which  satisfies  0  <  r  =  h/a  <  1.   Let  N  be 

i  u      ~j  c      |N-l+s.hi   . 

large  enough  to  satisfy  — <1, 

N   a 


v(z) 


N 
Z   V 


j* 


d  = 


N-l+s 
N   ' 


rV(1 


r), 


and  B 


rdVN/(l 


rd).   Then  [A]  showed  that 


0  <  A  i  TN  1  B>  for  1  £  s;  and  B  <  T  <A,  for 

1 >  s.   A  and  B  depend  on  the  h  which  was  used 
to  expand  the  series,  but  the  series  can  easily 
be  shifted  to  a  new  h*  by  multiplying  the  j-th 

term  by  (h*/h)  J~1. 

We  wish  to  choose  an  optimal  stepsize  ratio 
U  =  h*/a  as  large  as  possible  while  maintaining 
one  of  the  error  measures: 

Type  i)   absolute  truncation  error  per  step, 

|TNN  or 
Type  ii)  absolute  truncation  error  per  unit  step, 
|TN/h|; 

less  than  a  prescribed  tolerance,  e.   Relative 
or  mixed  error  measures  are  handled  in  a  similar 
manner.   For  Type  i)  error  if  s  <  1,  then  U  is 
the  single  real  root  on  [0,  1]  of 


V. 


P(x) 


N 


N-l 


Let  6 


«   /vN. 


N  j. 
x  +  ex 


Then^U 


1.1/N 

N    )  is  an 


approximate  root  of  p(x),  so  we  use  it  as  an 
estimate  for  U.   Estimates  for  other  error  types 
can  be  computed  similarly,  and  some  are  given 
below.   Figure  1  shows  an  example  of  how  these 
estimates  depend  on  the  tolerance,  c,  and  on  the 
series  length,  N. 
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1 

Error 

Estimated  Optimal 

Measure 

Stepsize  Ratios. 

Type  i) 
s  <  1 

61/N(1-|61/N) 

s  >  1 

(f)1/NU-f  61/N) 

Type  ii) 
s  <  1 

1 
(a6)N-1(l  -  NTi(a6) 

1 
■-1, 

s  >  1 

1 
.a6.N-l..    d  .   x. 
<*->    (1  "  FT  U6) 

1 

Table  1.   Optimal  Stepsize  Ratios 

We  have  shown  how  to  compute  an  optimal 
stepsize  ratio.   Next  we  consider  the  choice  of 
the  series  length.   The  codes  produced  by  the 
various  translator  programs  use  recurrence  rela- 
tions to  generate  the  series.   Fast  series 
generation  techniques  have  not  yet  been  found 
suited  to  this  application,  so  each  integration 
step  requires  aN2  +  3N  +  y   floating  point  opera- 
tions.  The  approximate  cost  of  solving  the 
entire  problem  is  proportional  to 

qN2  +  gN  +  y 
U 
As  an  example,  the  code  produced  by  Chang's  ATS 

2 
translator  for  y"  =  y  -  (y')  requires  approxi- 

2 
mately  1.5N  +  15N  +  100  floating  point 

operations  per  step.   The  translator  programs 

can  be  modified  to  supply  their  object  codes 

with  the  values  for  a,  6,  and  y   so  that  C(N)  can 

be  evaluated  at  run  time  to  choose  an  optimal 


C(N) 


series  length,  N 


opt 


As  experience  with  other 


methods  has  indicated,  problems  with  a  more 
stringent  error  tolerance  are  solved  more 
efficiently  by  higher  order  methods  (longer 
series).   As  indicated  by  Figure  2,  C(N) 
typically  has  a  broad  minimum,  so  the  exact 
choice  of  N  is  not  critical.   In  practice,  a 
series  length  longer  than  N    should  be  used 

because  the  increased  accuracy  of  the  estimates 
for  Re  from  a  longer  series  balances  the  in- 
creased cost  of  its  generation. 

The  preceeding  discussion  assumed  that  the 
solution  to  the  ODE  had  only  one  primary  sing- 
uoarity.   If  the  solution  has  a  conjugate  pair 
(a  and  ~a)    of  primary  singularities  of  order  s, 
then  its  series  is  asymptotic  to  EF . ,  the  series 


for  f (x)  =  (a  -  z)  8  +  (a  -  z)  S. 


j' 

If  EV 


is  the 


series  for  v(z)  ■  (|a|  -  z) 

2V   cos(s+J)9,  where  6  = 
truncation  error  for  f  it 


then  F, 


j+l 

arg  a.   Hence  the 
Ls  at  most  twice  the 
truncation  error  for  v  and  the  estimates  for  U 
given  in  Table  1  for  a  single  primary  singularity 
remain  valid  for  a  conjugate  pair  of  primary 


singularity  remain  valid  for  a  conjugate  pair  of 
primary  singularities,  provided  that  £  is  re- 
placed by  e/2. 


50        100 

SERIES  LENGTH 


150 


200 


FIGURE  1.   OPTIMAL  STEP  FOR  ABSOLUTE 
TRUNCATION  ERROR  PER  UNIT  STEP. 
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FIGURE  2 

Y"  =  Y 

p  =   15,  AND 


.   COMPUTATIONAL  COST  FOR 
-  (YV   WITH  06=  1.5, 


7  =  100, 


We  have  shown  how  to  choose  dynamically  a 
series  length  which  approximately  minimizes  the 
computational  cost  of  solving  a  system  of  ODE's 
by  series  methods.  The  optimal  ratio  of  the 
stepsize  to  the  series  radius  of  convergence  has 
been  given.   These  choices  depend  on  series 
length,  locations  and  orders  of  primary  singulari- 
ties, error  tolerance,  and  the  type  of  error 
measure  the  user  wishes  to  control.  Work  is  in 
progress  with  Roy  Morris  to  incorporate  the  ideas 
in  this  paper  into  a  suitable  translator  program. 
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In  this  talk,  three  versions  of  a  new  extrapolation  method  for  stiff  systems  of  ODEs  are  presented. 

Let 

(1)  y'  =  f(t,y)     ,     y(o)  =  yQ 

denote  the  stiff  initial  value  problem.  Then  the  main  idea  is  to  rewrite  the  system  in  the  form 

(D     y'  -  Ay  =  f(t,y)  -  Ay  =:  f(t,y)  , 

with 

A  «  fy(o,yo) 

and  to  discretize  the  above  left-hand  side  separately  -  but  not  in  terms  of  a  sophisticated  approximation 
of  the  associated  matrix  exponential.  Rather,  in  view  of  the  extrapolation  to  be  combined  with  the  dis- 
cretization, the  crude  approximations  (stepsize  h) 

(2)  E(Ah)  =  I  +  Ah,  E(-Ah)~'  =  (I  -  Ah)"' 

turn  out  to  be  sufficient.  The  main  difficulty  in  the  construction  of  such  an  extrapolation  method  is  to 

2       .  .  2 

assure  both  the  existence  of  an  asymptotic  h  -expansion  (that  permits  one  to  use  h  -extranolation)  and 

desirable  stability  properties  (that  make  the  method  efficient  for  stiff  ODEs) . 

The  semi-implicit  mid-point  rule  to  be  presented  here  has  been  derived  from  applying  the  explicit  mid-point 


rule  (GBS-method)  to  the  equation  (1')  using  the  exact  matrix  exponential   exp(Ah)   and  replacing  it  de- 
liberately by  either  E(Ah)   or  E(-Ah) 

(3)   a)   semi-implicit  Euler  starting  step: 


liberately  by  either  E(Ah)   or  E(-Ah)   .   With  (2),  the  following  scheme  is  obtained  (compact  version) 


Ao  :=  (I-hA)   hf(to,yo) 


b)   semi-implicit  mid-point  steps: 


k  =  1  ,  .  .  .  ,  I  -   1 


\   :=  Vi  +  Vi 

\   im   Vl  +2n-hA)-'  [hf(tk,nk)  -  Ak_,] 


c)   final  smoothing  step: 


Vi  +  Vi 


Vl  +  H-hA)"1  [hf  (t?,nv)  -  &     ,  J 
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■■  n4_,  ♦  a£  . 

2 
With  S  ■:  yh(t)  .  *h  ■  t   fixed,  the  existence  of  an  h  -expansion  is  shown.  For  stability  reasons,  t"4ni  , 

m  integer,  is  chosen. 

On  the  basis  of  algorithm  (3)  and  of  detailed  investigations  in  terms  of  S-stability,  two  further  schemes 
have  been  developed.  In  one  of  these  schemes,  the  starting  step  (3a)  is  replaced  by  an  implicit  Euler  step. 
In  the  other  scheme,  the  smoothing  step  (3c)  is,  in  addition,  repeatedly  applied  at  every   t  .  .  The  itera- 
tive application  of  that  smoothing  step  leads  to  a  special  realization  of  the  following,  fully  implicit 
discretization  scheme: 


n.  :-no+hf(t|,n1) 
n2  :-  r\^   +  h  f  (t2,n2) 


,  ....  2m- I 


n2k+l  :"  2  n2k  "  n2k-l  "  n2k  +  hf  (t2k'  n2k> 
n2k+2  :=  n2k+l  +  hf  (t2k+2'  n2k+2}  ' 

This  scheme  may  be  regarded  as  an  implicit  trapezoidal  rule  on  a  2h-grid: 

n2k+2  "  n2k  =  h  [f(t2k'  n2k}  +  f(t2k+2'  n2k+2}  ] 

started  by  two  implicit  Euler  steps  on  an  h-grid.  The  intermediate  quantities  r\  are  only  used  to  con- 

struct useful  starting  points  for  a  Newton-like  iteration  to  solve  the  implicit  step  (details  omitted  here). 

Moreover,  in  the  implicit  steps,  an  additional  stepsize  restriction  is  derived  on  the  basis  of  the  associ- 

2 
ated  affine  invariant  convergence  theorem.  Once  more,  an  h  -expansion  exists. 

The  three  above  sketched  extrapolation  methods  have  been  tested  extensively,  among  other  examples  also  by 
means  of  the  well-known  sample  set  STIFF  DETEST.  The  results  seem  to  indicate  that  the  semi-inplicit  mid- 
point rule  in  combination  with  the  implicit  Euler  starting  step  is  the  preferable  choice.  All  of  the  methods 
compare  quite  well  with  the  code  GEAR  (due  to  Hindmarsh) .  Detailed  comparisons  will  be  given. 
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WHAT  MIGHT  ODE  SOLVERS  BE  TESTED  FOR? 


T.E.  Hull 
Department  of  Computer  Science 
University  of  Toronto 
Toronto,  Ontario,  Canada 


Outline 

I  have  found  it  convenient  to  divide  the 
material  of  this  talk  into  three  parts.   The  first 
part  is  intended  to  provide  a  brief  summary  of  the 
testing  that  took  place  during  the  "classical" 
period  of  the  1950's  and  1960's.   The  second  is 
intended  to  provide  a  similarly  brief  summary  of 
the  "modern"  period,  the  1970's.   The  third  and 
more  extensive  part  consists  of  some  speculation 
about  what  we  might  look  forward  to  in  a  "future" 
period  of  testing  of  ODE  solvers. 

The  emphasis  is  primarily  on  testing  in  the 
experimental  sense,  that  is,  on  testing  programs 
for  solving  ODEs  on  a  computer.   However,  testing 
in  the  theoretical  sense  is  included  as  well,  as 
for  example  in  testing  mathematically  that  a  cer- 
tain class  of  formulas  is  asymptotically  unstable, 
since  it  seems  to  me  that  the  two  aspects  of  test- 
ing complement  one  another  and  are  best  considered 
together. 

In  a  relatively  brief  survey  of  such  a  large 
field,  it  is  impossible  to  be  complete,  and  in 
particular  to  attempt  anything  like  a  complete 
bibliography.   However,  several  recent  papers  and 
books  do  contain  substantial  lists  of  references. 
I  therefore  decided  to  omit  references  altogether 
from  this  summary,  and  only  mention  in  the  text 
itself  a  number  of  authors'  names  and  dates.   Those 
that  are  mentioned  are  intended  to  be  reasonably 
representative,  but  by  no  means  complete. 

The  next  section  of  this  summary  is  devoted 
to  the  first,  or  classical  stage  of  the  1950-60's. 
This  stage  is  characterized  primarily  by  a  study 
and  comparison  of  mathematical  formulas,  as 
opposed  to  anything  like  what  we  would  now  call 
mathematical  software.   The  mathematical  results 
obtained  during  this  period  were  quite  substantial, 
however,  and  had  very  important  practical  impli- 
cations, particularly  with  regard  to  the  stability 
properties  of  formulas  (first  for  non-stiff,  and 
later  for  stiff  problems).   Experimental  testing 
provided  some  important  results  as  well,  for 
example  in  pointing  to  the  possible  importance  of 
variable  order  Adams  formulas,  but  the  experiments 
were  relatively  modest  comparisons  of  various  formu- 
las using  fixed  step  sizes. 

Then  a  further  section  of  this  summary  is 
'  <-d  to  the  second,  or  modern  stage  of  the 
1970'b.   This  stage  is  characterized  by  a  shift  in 


emphasis  to  relatively  large  scale  testing  of 
"real"  computer  programs  over  fairly  substantial 
classes  of  problems.   There  were  also  a  number  of 
important  theoretical  contributions  related  to  the 
testing  of  various  approaches  during  this  period, 
for  example  with  regard  to  the  stability  of  methods 
for  stiff  systems,  but  the  emphasis  in  testing  had 
certainly  shifted  towards  large  scale  experiments. 
Based  on  the  results  obtained,  a  number  of  conclu- 
sions have  been  reasonably  clearly  established 
about  the  reliability  and  efficiency  of  various 
classes  of  computer  programs,  particularly  for  non- 
stiff  problems,  but  also  to  a  limited  extent  for 
stiff  problems  as  well.   However,  the  developments 
and  conclusions  attributable  to  this  stage  brought 
with  them  many  new  and  interesting  questions  about 
what  we  should  be  trying  to  accomplish  when  we  test 
ODE  solvers. 

The  final  section  of  this  summary  is  intended 
to  help  clarify  a  few  of  these  questions,  and 
perhaps  provide  some  suggestions  about  what  steps 
we  might  take  to  resolve  some  of  them.   It  is  clear 
that  our  objectives  must  be  more  clearly  defined, 
that  we  must  broaden  the  classes  of  problems  over 
which  the  tests  are  performed,  and  above  all  that 
our  testing  must  reflect  as  closely  as  possible  the 
real  needs  of  the  ultimate  user,  for  example  in 
terms  of  such  factors  as  convenience,  robustness, 
flexibility,  and  so  on.   Such  changes  will  of 
course  reflect  back  in  turn  on  design  objectives 
for  the  development  of  good  numerical  software  for 
solving  ODEs. 

Classical  stage 

As  already  indicated,  the  important  develop- 
ments of  this  stage  appear  to  be  more  theoretical 
than  experimental.   The  beautiful  results  establish- 
ed by  Dahlquist  in  the  mid  1950's  can  be  considered, 
from  the  point  of  view  of  testing,  as  providing  a 
definitive  test  for  determining  which  multistep 
formulas  are  asymptotically  stable  and  which  are 
not . 

Then  in  1963  Dahlquist  "did  it  again",  when 
he  published  a  paper  on  A-stability  which  provided 
the  impetus  for  much  of  the  subsequent  growth  of 
interest  in  methods  for  solving  stiff  problems. 
The  ideas  and  the  criteria  were  essentially  theore- 
tical.  Other  theoretical  developments  were  also 
taking  place  in  the  1960's.   With  the  help  of 
hindsight,  we  see  that  among  the  more  notable  from 
the  point  of  view  of  testing  mathematical  formulas 
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for  solving  ODEs,  were  the  extrapolation  method 

1  by  Cragg  (and  extended  by  Bulirsch  and 
Stoer)  and  the  Runge-Kutta  formula  pairs  developed 
by  Fehlberg  and  others. 

The  experimental  testing  of  this  period  was 
relatively  limited,  and  was  restricted  mostly  to 
comparing  formulas  of  fixed  order,  using  fixed 
stepsizes,  on  relatively  small  numbers  of  problems. 
For  example,  some  attempts  were  made,  independently 
by  R.ilston  and  Johnston,  to  determine  which  were 
the  better  Runge-Kutta  formulas.   Of  the  testing  I 

associated  with  during  this  period,  I  believe 
that  the  results  obtained  with  Creemer  and  publish- 
ed in  1963  were  the  most  significant.   After  several 
years  of  trying  to  determine  the  best  combination 
of  accuracy  and  stability  in  multi-step  formulas, 
we  finally  obtained  experimental  evidence  that  con- 
vinced us  we  should  use  Adams  formulas,  and  worry 
about  order,  and  not  about  more  accurate  formulas 
of  a  particular  order.   We  also  concluded  that  one 
corrector  per  step  was  optimal,  but  that  this 
corrector  needed  to  be  followed  by  an  evaluation 
(PECE)  for  stability.   We  didn't  foresee  the 
development  of  variable  order,  variable  step  Adams 
codes  which  are  so  common  nowadays,  but  I  wish  we 
had! 

Modern  stage 

During  the  1970's  there  has  been  a  rapid 
development  of  techniques  for  handling  stiff  systems, 
including  theoretical  criteria  for  testing  the 
stability  of  various  formulas.   Theoretical  criteria 
were  also  developed  for  non-stiff  methods.   For 
example,  criteria  for  variable  order,  variable  step 
methods  were  developed  at  the  University  of 
Illinois.   More  recently  a  theoretical  criterion 
for  testing  Runge-Kutta  formula  pairs  was  developed 
at  the  University  of  Toronto. 

Led  by  the  publication  of  Gear's  DIFSUB,  sub- 
stantial codes  began  to  be  widely  available  during 
this  period.   Some  of  the  better  known  ones  in 
North  America  are  associated,  for  example  with  the 
names  of  Bulirsch  and  Stoer,  Hindmarsh  and  Byrne, 
and  Shampine  and  Watts.   But  others  have  been 
developed  both  on  this  continent  and  in  Europe. 
(For  the  purposes  of  this  talk,  Britain  is  consi- 
dered to  be  a  part  of  Europe.) 

Experimental  testing  of  such  programs  also 
began  to  be  carried  out  on  a  relatively  large 
scale.   An  early  series  of  tests  was  carried  out 
at  the  Bell  Telephone  Laboratories.   Even  more 
extensive  testing  has  been  done  at  the  Sandia 
Corporation,  at  the  University  of  Manchester,  and 
at  the  University  of  Toronto.   The  emphasis  in  all 
of  this  testing  has  been  primarily  concerned  with 
reliability  and  efficiency,  in  one  form  or  another, 
of  the  various  methods. 

The  Toronto  testing  program  is  called  DETEST. 
It  has  changed  considerably  over  the  years,  but 
basically  it  collects  statistics  on  reliability  and 
efficiency  over  several  classes  of  problems.  Care 
was  taken,  in  all  the  ways  we  thought  were  possible, 
to  produce  results  that  were  not  affected  appre- 
ciably by  relatively  separate  issues,  for  example 
by  issues  such  as  determining  the  starting  stepsize. 

The  original  design  was  aimed  specifically  at 


comparing  methods,  as  opposed  to  simply  making  In- 
dependent assessments  of  them.   This  caused  us  to 
Insist  that  each  method  try  to  accomplish  the  same 
task,  namely  to  keep  the  error  per  unit  step  below 
the  prescribed  tolerance,  and  this  led  us  to  modify 
the  programs  accordingly.   This  caused  a  great  deal 
of  controversy.   Moreover,  an  "error  per  step"  rule 
is  more  efficient  and,  in  addition,  makes  it 
possible  to  step  over  the  most  common  kinds  of 
discontinuities.   The  present  version  of  DETEST 
therefore  does  not  require  any  modification  of  the 
program  being  tested.   It  is  aimed  more  at  the 
assessment  of  methods,  rather  than  their  comparison. 
However,  a  new  notion  of  "normalized  statistics" 
has  been  introduced  to  provide  an  indirect  method 
of  comparison.   The  present  version  is  also  much 
more  flexible.   For  example,  it  is  easy  to  select 
what  problems  are  to  be  used,  or  to  add  new  ones, 
and  global  statistics  can  be  obtained,  as  well  as 
local  ones.   NEW  DETEST  also  contains  a  class  of 
problems  with  discontinuities. 

Out  of  all  the  testing  that  has  been  done  on 
non-stiff  problems,  a  number  of  general  conclusions 
have  become  established.   For  example,  one  happy 
conclusion  is  that  there  do  not  seem  to  be  any 
serious  "class  distinctions",  by  which  I  mean  that 
methods  that  are  relatively  good  for  one  class  of 
problems  seem  to  be  relatively  good  for  the  other 
classes  as  well.   A  second  conclusion,  which  is  to 
be  expected,  is  that  variable  order  methods  are 
needed  to  handle  a  wide  range  of  tolerances.   A 
third  and  most  interesting  conclusion  is  that  func- 
tion evaluations  need  to  be  relatively  expensive  to 
justify  the  extra  overhead  that  multi-step  methods 
require  in  order  to  keep  down  the  number  of  func- 
tion evaluations.   This  last  point  was  the  key  one 
in  comparing  extrapolation  methods  with  Adams 
methods  according  to  our  earlier  tests,  while  the 
Runge-Kutta  methods  were  occasionally  unreliable. 
However,  the  newer  Runge-Kutta  methods  based  on 
formula  pairs  appear  to  have  displaced  the  extra- 
polation methods  as  the  best  ones  to  use  when  func- 
tion evaluations  are  relatively  inexpensive. 

The  corresponding  situation  with  the  results 
of  STIFF  DETEST  is  more  complicated,  and  the  con- 
clusions are  not  nearly  so  definite.   For  one  thing, 
the  Jacobian  plays  an  important  role,  and  an  impor- 
tant factor  in  assessing  different  methods  is  the 
dimension  of  the  problem,  since  two  methods  may 
differ  in  the  relative  number  of  nJ  and  n  opera- 
tions they  perform.   Also  it  may  matter  whether  the 
Jacobian  is  determined  numerically,  whether  it  is 
sparse  or  not,  and  so  on.   Other  important  factors 
appear  to  be  the  nearness  of  the  eigenvalues  to  the 
imaginary  axis,  the  degree  of  non-linearity,  the 
kind  of  coupling  between  the  smooth  and  non-smooth 
components  of  the  solution,  and  the  tolerance.  Also, 
methods  are  more  likely  to  fail  on  stiff  problems, 
and  this  makes  the  collecting  of  meaningful  statis- 
tics more  difficult. 

Nevertheless,  it  does  appear  that  programs 
based  on  backward  differentiation  formulas  stand  up 
very  well,  their  main  weakness  being  when  the 
eigenvalues  are  near  the  imaginary  axis.   But  there 
are  other  good  methods  as  well,  and  much  more  needs 
to  be  learned  about  them.   For  example,  there  are 
good  methods  based  on  second  derivative  formulas, 
implicit  Runge-Kutta  formulas,  extrapolation,  and 
block  implicit  techniques. 
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The  future 

Of  course  it  is  easy  to  see  that  in  the  imme- 
diate future  we  can  expect  further  testing  in  terms 
of  reliability  and  efficiency  of  more  programs  over 
broader  classes  of  problems.   But  I  would  like  to 
make  some  brief  comments  on  a  number  of  specific 
areas  in  which  I  consider  it  would  be  especially 
worthwhile  trying  to  improve  on  what  we  have  been 
doing.   One  common  feature  in  each  area  is,  I 
believe,  an  attempt  to  focus  attention  on  those 
aspects  of  testing  that  reflect  the  real  needs  of 
the  ultimate  user  of  ODE  software,  at  least  insofar 
as  it  is  realistic  to  do  so. 

As  a  first  point,  consider  the  interpretation 
of  the  tolerance.   As  a  consequence  of  earlier 
arguments  about  error-per-step  and  error-per-unit- 
step,  a  number  of  us  have  become  convinced  that  a 
very  good  way  to  interpret  the  parameter  TOL  is  in 
terms  of  proportionality.   The  design  goal  for  the 
ODE  program  is  then  to  produce  answers  whose  global 
error  has  a  norm  that  is  proportional  to  TOL.   (This 
is  intended  to  be  quite  independent  of  whether  the 
error  is  relative  or  absolute,  or  of  whether  com- 
ponents of  the  error  vector  are  weighted  in  some 
special  way.)   The  factor  of  proportionality  is  of 
course  problem  dependent,  but  this  interpretation 
is  easily  understood  by  a  user,  and  also  easily 
exploited  by  him.   The  current  version  of  DETEST 
computes  a  measure  of  how  smoothly  the  error  depends 
on  TOL,  but  improved  measures  need  to  be  considered. 
It  should  be  noted  that  it  is  often  particularly 
difficult  to  meet  this  design  goal  for  large  TOL. 
It  should  also  be  noted  that  the  frequency  with 
which  output  is  requested  could  have  a  considerable 
effect  on  how  well  some  programs  meet  this  goal. 

A  second  point  that  I  would  like  to  mention 
concerns  the  scale  of  a  problem.   I  have  often 
thought  that  it  was  not  unreasonable  to  expect  the 
user  to  provide  some  measure  of   the  scale  of  his 
problem,  as  part  of  the  problem  specification. 
Requiring  HMAX  is  a  possibility,  but  HMAX  has  the 
disadvantage  that  it  depends  on  the  method  as  well 
as  the  problem,  and  the  user  should  only  be 
required  to  provide  information  about  his  problem. 
Many  users  would  object  to  any  such  requirement 
(even  if  we  had  a  good  idea  of  what  we  meant  by 
it!),  but  I  would  at  least  like  to  have  the  option 
of  providing  a  value  of  SCALE,  if  only  as  a 
"knob"  that  can  be  turned  one  way  to  make  the 
method  more  reliable  (and  the  other  way  to  make  it 
more  efficient  of  course) .   In  our  earlier  tests 
of  Runge-Kutta  methods  we  found  some  of  them  to 
be  quite  unreliable,  until  we  imposed  a  maximum  on 
the  stepsize,  in  which  case  the  improvement  was 
dramatic.   The  test  results  can  therefore  depend 
in  a  crucial  way  on  how  this  question  is  handled. 
Of  course  we  can  simply  use  default  values,  or 
build  heuristics  into  our  programs  to  estimate  the 
scale  of  a  problem.   In  any  event,  I  believe  that 
more  attention  should  be  given  to  this  question, 
fit.  should  of  course  be  acknowledged  that  this 
question  is  made  more  complicated  by  the  fact  that 
the  scale  may  change  during  the  course  of  the 
integration. ) 

What  other  measures  of  reliability  or  robust- 
ness should  we  test  for?  Can  we  test  a  program  for 
Its  ability  to  detect  stiffness,  or  the  presence  of 

i  i  ■■'  or.r  Iri'il  tJMB,    DX  trouble  with  roundoff?   Can  wo 


test  separately  its  ability  to  get  started?  Or  to 
cope  with  discontinuities?  What  about  improper 
use?    (A  common  example  of  the  latter  is  undefined 
input  variables,  but  our  programming  languages  seem 
to  make  it  virtually  impossible  to  deal  with  this 
problem  in  a  decent  way.) 

With  regard  to  efficiency,  it  seems  to  me  that 
more  thought  should  be  given  to  standardizing  the 
way  in  which  we  present  the  results.   For  example, 
with  non-stiff  methods  for  a  particular  class  of 
problems,  I  wonder  if  we  could  present  the  results 
in  terms  of  a  small  number  of  parameters.   One 
possibility  might  be  the  parameters  h,  f,  t  and  k, 
where  the  computing  cost  has  been  determined  to  be 


h  +  f*F  +  t*TOL 


Thus  h  is  a  kind  of  overhead  cost,  f  is  the  coeffi- 
cient related  to  the  cost  F  of  evaluating  functions, 
while  t  and  k  appear  only  in  the  contribution  due 
to  TOL.   It  would  be  particularly  helpful  if  some 
such  result  could  be  normalized  in  such  a  way  that 
it  was  relatively  independent  of  the  problem  class. 
Of  course  it  would  also  be  helpful  if  the  normali- 
zation could  factor  out  the  dependence  on  a  parti- 
cular machine,  and,  as  well,  the  dependence  on 
implementation  details  such  as  how  the  compiler 
handles  subscripts.   Other  components  of  cost,  such 
as  storage,  and  other  properties  of  the  method, 
such  as  how  well  it  gets  started,  or  how  well  it 
copes  with  discontinuities,  would  also  have  to  be 
considered,  but  it  would  be  very  helpful  if  these 
could  also  be  presented  in  some  sort  of  normalized 
way. 

The  same  sort  of  objective  wuld  of  course  be 
desirable  in  the  case  of  stiff  systems  as  well,  but 
the  situation  would  be  much  more  complicated  here. 
Nevertheless,  we  are  already  measuring  the  number 
of  times  an  nJ  process  is  invoked,  the  number  of  n 
processes,  and  so  on,  so  we  have  some  of  the  ingre- 
dients . 

I  continue  to  believe  that  theorems  about  pro- 
grams are  important.   I  have  in  mind,  for  example, 
proving  that  a  particular  method  solves  all  pro- 
blems in  a  particular  class,  in  the  sense  that  it 
produces  z(x)  such  that  ||z' (x)-f  (x,  z)  ||  <  TOL.   I 
realize  that  such  theorems  can  be  established  only 
over  very  limited  classes,  and  are  really  not  very 
practical.   However,  I  continue  to  believe  that 
establishing  of  such  theorems  is  a  good  necessary 
condition  to  be  satisfied  and,  moreover,  that  the 
weakness  of  the  conditions  under  which  such  a 
theorem  can  be  established  is  a  measure  of  the 
strength  of  a  method. 

With  regard  to  broadening  the  class  of  problems 
over  which  methods  are  tested,  I  have  a  number  of 
points  in  mind.   One  has  been  impressed  on  me 
because  none  of  our  problems  ever  uncovered  the 
inability  of  Fehlberg  pairs  of  order  more  than  4 
to  solve  simple  quadrature  problems  of  the  form 
y1  =  f(x).   A  second  point  is  to  have  more  problems 
in  which  stiffness  changes.   A  third  is  to  develop 
problems  for  testing  the  sensitivity  of  methods  to 
changing  parameters.  (I  would  like  to  ask  what  we 
should  do  about  the  situation  that  often  arises 
in  chemical  kinetics  when,  due  to  a  minor  numerical 
error,  it  is  possible  for  one  of  the  concentrations 
in  Ih-i  t  ,ini  ■  i  k  i  ■ .  1 1  i  v  i  •  ami  throw  the  calculation  ofi 
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I  would  like  to  conclude  with  a  reminder  that 
good  documentation  is  needed  and,  if  we  are  testing 
-,  we  should  include  some  sort  of  assessment 
t  documentation.   Of  course  I  don't  mean  to 
suggest  that  this  can  be  quantified,  but  some  kind 
of  assessment  would  be  helpful.   Perhaps  this  would 
also  involve  a  statement  about  how  convenient  the 
program  is  to  use,  about  what  options  are  available 
(that  have  not  already  been  considered) .   One 
particularly  difficult  option  to  deal  with  is  the 
incorporation  of  stopping  criteria  other  than  the 
standard  one  of  stopping  at  a  particular  value  of 
the  independent  variable.   Another  property  that 
must  not  be  forgotten  is  the  extent  to  which  any 
special  knowledge  that  the  user  might  have  can  be 
exploited,  regarding  the  stiffness  of  the  problem 
for  example,  or  some  measure  of  the  scale,  or 
whether  the  problem  is  linear  or  not,  or  whether 
the  user  can  provide  the  Jacobian,  and  so  on. 

Altogether,  it  appears  that  our  ODE  solvers 
might  be  tested  for  many  different  things,  and  some 
of  those  properties  that  are  already  being  tested 
might  be  done  in  ways  that  are  at  least  somewhat 
more  carefully  defined  and  useful. 
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Abstract 

The  latest  version  of  the  NAG  library  includes 
a  suite  of  subroutines  based  on  the  Runge-Kutta- 
Merson  formula.   This  suite  includes  a  main  routine 
with  a  variety  of  features  and  a  number  of  driver 
routines  designed,  in  the  main,  for  "black-box" 
use.   These  routines  and  a  number  of  auxiliary 
ones  are  described  in  detail. 

1 .    Introduction 

The  NAG  library  ordinary  differential  equation 
chapter  contains  a  set  of  subroutines . based  on  the 
Runge-Kutta-Merson  formula  for  integrating  the 
initial  value  problem 


f(x,  y),  y(X)   given 


(1.1) 


over  a  range   [X,  XEND,.   At  the  lowest  level 
there  is  a  subroutine  D02YAF  which  is  designed  to 
take  one  step  of  predetermined  length  using  the 
Merson  formula.   This  will  be  described  in  more 
detail  in  the  next  section.   At  the  next  level  is 
a  general  subroutine  D02PAF  which  is  designed  to 
integrate  across  a  range  by  calling  D02YAF  at 
each  step.   D02PAF  also  has  a  number  of  other 
features  such  as  interrupt  and  interpolation 
facilities  and  these  will  be  described  in  detail 
in  section  3.   At  a  higher  level  still  there  are 
several  driver  routines  for  D02PAF.   In  the  main, 
these  routines  have  been  designed  so  that  the 
inexperienced  user  can  solve  his  problems  without 
recourse  to  D02PAF.   However,  one  driver  routine, 
D02BDF,  is  designed  to  calculate  global  error 
estimates  and  to  perform  stiffness  checks,  and 
this  is  intended  for  general  use.   All  these 
routines  are  discussed  in  section  A. 

Specifications  of  all  the  subroutines 
discussed  here  can  be  found  in  the  NAG  Manual, 
Mark  7  (1979). 


calculation  of  the  error  estimate  was  significant, 
componentwise.    In  designing  this  significance 
test  we  observe  that  the  error  estimate  is 
calculated  by  componentwise  subtraction  of  two 
approximate  solutions  at  X  +  H   (as  is  almost 
always  the  case  in  Runge-Kutta  routines).   We  flag 
insignificance  if,  in  this  subtraction,  all  the 
significant  digits  of  the  two  solutions  cancel  to 
rounding  error  level.   Since  the  step   H  is 
determined  (in  routine  D02PAF)  from  the  error 
estimate  we  feel  that  it  is  important  to  be  aware 
if  this  determining  factor  is  completely  dependent 
on  the  wordlength  characteristics  of  the  computer 
in  question.   In  fact,  the  workspace  array  has  been 
so  arranged  that  all  the  workspace  used  on  the 
intermediate  steps  in  the  Runge-Kutta  step  is  used 
again  in  returning  the  information  about  the 
solution  and  its  derivatives,  and  the  error  estimate 
and  its  significance. 

There  are  two  other  modes  of  ral lino  rmtrinp 
D02YAF.   Both  are  designed  for  use  on  calls  from 
routine  D02BDF.   This  routine  computes  global  error 
estimates  for  solutions  of  routine  D02PAF  and 
attempts  to  determine  the  degree  of  "stiffness"  of 
the  system  (1.1).   In  neither  of  these  calls  is 
there  any  need  to  retain  information  for  inter- 
polation purposes  and  immediate  storage  savings  are 
made.   Also,  when  calculating  global  error  estimates 
the  stensize  is  controlled  by  independent  calls  to 
D02PAF  so  that  the  local  error  estimate  is  not 
needed,  leading  to  savings  in  computer  time  and 
storage.   When  calculating  the  degree  of  stiffness, 
one  of  the  algorithms  used  is  essentially  that  of 
Shampine  (1977)  which  requires  a  local  error 
estimate  for  Euler's  method  with  step  H   for  use 
in  a  comparison  with  the  Runge-Kutta-Merson  error 
estimate.   This  is  returned  in  a  column  of  the 
workspace  used  on  a  normal  call  for  returning 
interpolation  information. 


2. 


Subroutine  D02PAF 


In  its  normal  mode  of  operation  routine 
D02YAF  takes  a  step  of  length   H  from  a  current 
point  X  at  which  point  the  solution  is  stored  in 
the  array  Y.   It  produces  the  solution  at   X  +  H 
in  Y  and  it  reorganises  columns  of  the  workspace 
array  to  retain  values  of  y'   at   X  +  H  and  y 
and  y'   at   X   for  use  by  the  interpolation 
routines  which  are  to  be  discussed  in  section  3 
below.   The  routine  D02YAF  also  returns  a  local 

the  workspace  and  in 
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3.    Subroutine  D02PAF  and  auxiliary  interpolation 
routines 

Routine  D02PAF  is  designed  to  integrate  the 
system  (1.1)  from  an  initial  point   X   to  a  final 
point  XEND  using  a  sequence  of  calls  to  routine 
D02YAF.   The  stepsizes  in  the  calls  to  D02YAF  are 
designed  to  control  the  local  error,  with  a  special 
short  last  step  to  reach  XEND  exactly.   In  many 
respects  D02PAF  is  similar  to  well-known  routines 
RKFAr)  (Shampine  and  Watts,  1976a)  and  DVERK  (Hull 
et  al.,  1976).   We  will  only  report  on  the  major 
points  of  di  i  i  el  ence  between  D02PAF  and  these  rout  Lnes. 
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Subroutine  D02PAF  has  the  calling  sequence 

SUBROUTINE  D02?M\\,   SEND,  N,  Y,  CIN,    .  FCN, 
I  COMM,  CONST.  QOUT,  W,  IW,  1W1  ,  [FAIL) 

GES  N,  IV,   IW1,  IFAIL 
REA        .  YtH),  CIN(b),  TOL,  COMM(5),  CONST(3), 

i  cout(U),  w(iw,  ivn 

RSAL  FCN 

The  parameters   X  and  XEND  are  discussed  above, 
Y  is  used  to  pass  the  computed  solution  to  and 
from  the  routine,  TOL  is  used  for  local  error 

vol,  FCN  is  the  usual  routine  used  to  define 
the  differential  system,  the  array  W  is  used 
mainly  for  workspace,  and  IFAIL  is  an  error 
indicator  to  be  discussed  below. 

There  are  four  arrays  CIN,  COMM,  CONST,  COUT 
used  for  communication  with  routine  D02PAF,  and  in 

tain  circumstances  columns  of  the  workspace 
array  W  are  also  used.   In  general  terms,  the 
array  CIN  is  used  to  communicate  information  to 
control  the  mode  of  entry  to  the  routine,  to 
determine  the  type  of  error  control,  and  to  supply 
initial,  maximum,  minimum  and  continuation  step- 
sizes  when  needed.   The  array  COMM  is  used  to 
handle  a  variety  of  facilities  designed  to  inter- 
rupt the  integration  in  different  ways.   If  the 
integration  is  controlled  largely  through  the  use 
of  interrupts  with  no  intention  to  integrate  to 
XEND,  then  the  value  XEND  is  essentially  used  to 
define  a  "scale"  for  the  integration.   Both 
CIN  and  COMM  are  described  in  more  detail  below. 
The  array  CONST  can  be  used  to  pass  control 
constants  to  the  routine  to  assist  in  the  internal 
stepsize  strategy.   Two  of  the  elements  of  CONST 
can  be  used  to  vary  the  maximum  and  minimum  ratios 
in  attempted  stepsizes  on  successive  steps.   (Of 
course  the  stepsize  can  be  reduced  as  much  as 
necessary  at  any  step  by  trying  a  sequence  of 
stepsizes  of  successively  smaller  magnitude.)   The 
third  element  of  CONST  can  be  used  to  indicate  if 
the  differential  system  is  linear  and  then  a 
stepsize  strategy  is  adopted  to  exploit  the 
special  properties  of  the  Runge-Kutta-Merson 
formula  for  linear  problems.   The  array  COUT  is 
used  to  return  information  to  the  user.   Some  of 
this  information  is  statistical,  for  example 
maximum  and  minimum  stepsizes  used  so  far  in  the 
integration,  but  other  elements  of  COUT  are  used 
to  assist  in  interpolating  the  solution  within 
integration  steps,  and  yet  others  carry  infomation 
between  interrupts.   This  eliminates  the  need  for 
use  of  COMMON  but  places  the  user  on  trust  not  to 
change  elements  of  COUT  at  interrupts.   The 
elimination  of  COMMON  is  important  in  a  library 
environment  as  it  permits  integrating  separate 
systems  of  differential  equations  simultaneously 
using  the  interrupt  facility.   The  use  of  the 
four  communication  arrays  (five  including  the 
workspace  W)  rather  than  just  one,  as  in  DVERK, 
is  designed  to  permit  future  development  in  a 
library  environment  without  major  reprogramming 
effort. 

We  now  discuss  the  use  of  the  control  array 
CIN.   The  element  CIN(l)  is  used  to  control  the 
mode  of  integration.   On  initial  entry,  CIN(l) 
must  be  set  to  zero  or  one.   If  it  is  set  to  zero, 
default  values  of  the  other  elements  of  CIN,  and 
the  elements  of  COMM  and  CONST  are  used.   The 
routine  D02PAF  is  designed  to  treat  this  choice 
as  a  special  case  and  a  fast  integration  from  X  to 


resul t  s .   ii  ciN  (l )  is  set  I      all  the 
other  elements  of  CIN  and  the  eli        COMM  and 
CONST  must  be  set  (the  choice  zei  aull 

values  in  all  cases).  On  return,  by  reaching  XKND 
or  by  interrupt,  a  value  of  CIN(l)  between  two  and 
six  is  obtained  (depending  on  the  mode  of  exit  ), 
and  the  routine  can  be  re-entered  with  this  value. 
The  choice  CIN(2)  =  0.0  (the  default)  gives  mi 

itive/absolute  error  control  on  the  maximum  norm 
of  the  local  error.   Similarly  CIN(2)  =  1.0  and  2.0 
give  absolute  and  relative  (with  floor)  local  error 
tests  respectively,  and  the  choices  CIN(2)  =  3.0  and 
4.0  give  componentwise  mixed  and  absolute  local 
error  tests  respectively  (a  relative  test  is  a 
special  case  of  the  mixed  test  in  this  case).   In 
these  latter  cases,  columns  of  the  workspace  array 
are  used  to  pass  componentwise  error  cont 
information.   The  element  CIN(3)  can  be  used  to 
specify  a  minimum  stepsize.   However,  a  minimum 
stepsize  designed  to  ensure  progress  across  the 
range  of  integration  is  calculated  and  used  if  it  is 
larger  than  the  user's  choice.   (The  minimum  step- 
size  is  actually  used  only  to  activate  error  exits.) 
The  element  CIN(4)  can  be  used  to  pass  a  maximum 
stepsize,  the  default  being  0.5(XEND  -  X).   The 
element  CIN(5)  can  pass  an  initial  stepsize  estimate 
(and  returns  the  initial  stepsize  used).   If  the 
estimate  is  smaller  than  the  minimum  or  larger  than 
the  maximum  then  the  corresponding  minimum  or 
maximum  is  used.   If  a  default  value  is  requested, 
then  an  estimate  approximately  proportional  to 
(HyI|/||?  II)*(XEND  "  X)  *  (MACHEPS  *  T0L)p'2  is  used, 
where  p   is  the  order  of  the  method.   Safeguards 
are  taken  against  zero  values  arising  in  the  norms. 
This  estimate  usually  gives  a  step  rather  smaller 
than  the  largest  stepsize   allowed  by  the 
local  error  control.   This  is  deliberate  as  an 
iteration  is  employed  to  compute  a  realistic  initial 
step  and  care  must  be  taken  to  avoid  nvprflnw  in 
this  iteration.   The  element  CIN(6)  is  used  only  for 
output  of  the  next  stepsize  in  any  exit  from  D02PAF. 
This  stepsize  is  used  on  re-entry. 

The  array  COMM  is  used  to  enable  interrupt 
control  of  D02PAF.   If  C0MM(1)  >  0.0,  it  is  used  as 
an  upper  bound  on  the  maximum  norm  of  the  solution 
as  the  integration  proceeds.   The  integration  is 
interrupted  if  this  bound  is  exceeded  (this 
facility  is  designed  to  enable  users  to  avoid  over- 
flows).  If  C0MM(2)  >  0.0  it  is  used  as  an  upper 
bound  on  the  number  of  permitted  calls  to  routine 
FCN,  in  the  usual  way.   If  C0MM(3)  >  0.0,  an  inter- 
rupt is  made  on  a  change  of  sign  of  Y(I)  -  Z(I), 
I  =  1,  2,  ...,  N  where  the  user's  chosen  values 
Z(I)   are  passed  in  a  column  of  workspace  (this 
facility  is  useful  for  root-finding).   If 
COMM (A)  <  0.0,  an  interrupt  is  made  when  X  -  C0MM(5) 
changes  sign  (this  simplifies  output  at  user- 
selected  points).   If  COMM(A)  >  0.0,  an  interrupt  is 
made  after  every  step  in  the  manner  of  many  other 
solvers.   All  the  other  interrupts  can  clearly  be 
accommodated  by  using  COMM(A)  at  the  expense  of 
repeated  calls  to  D02PAF,  but  we  feel  this  is 
unjustified.   Because  of  the  interpolation  process 
to  be  described  below  two  steps  are  taken  before 
permitting  an  interrupt,  on  the  assumption  that  the 
user  will  wish  to  interpolate.   Hence  if  an  inter- 
rupt of  the  C0MM(1)  >  0.0  or  C0MM(3)  >  0.0   type  is 
required  on  the  first  accepted  step  we  adjust  the 
stepsize  so  that  the  interrupt  comes  on  the  second 
step  instead. 
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As  has  been  previously  stated,  D02PAF  has  been 
designed  to  permit  interpolation  to  the  approximate 
solution  within  integration  steps.   This  interpol- 
ation process  is  designed  to  use  the  minimum  amount 
of  retained  information  whilst  being  of  at  least 
the  same  order  of  accuracy  as  the  integration.   To 
minimise  the  amount  of  retained  information  we  use 
Hermite  interpolation  on  three  points  giving  an 
accuracy  of  0(h)   as  against  an  accuracy  of  0(h) 
in  the  integration.   This  choice  clearly  entails 
using  solution  and  derivative  information  at  X  +  H, 
X  and  X  -  H'   where  H'   is  the  stepsize  previous 
to  the  current  one.   Clearly  we  could  save  one 
column  of  workspace  storage  by  not  retaining, 
say,  the  derivative  at   X  -  H'.   However,  this 
would  mean  losing  symmetry  in  the  interpoaltion 
process  and,  in  any  case,  the  author  feels  that  it 
is  preferable  that  the  integration  error  dominate 
the  interpolation  error  so  that  by  varying  the 
local  error  tolerance  TOL  one  varies  the  accuracy 
at  all  points  approximately  equally.   We  could 
instead  have  retained  solution  values  at  several 
earlier  points  but  this  would  complicate  the  code 
far  more  than  our  choice.   Note  that  the  retained 
information  is  used  passively,  that  is  for  inter- 
polation purposes  only,  and  hence  should  have  no 
effect  on  the  stepsize  strategy.   This  ensures 
that  the  algorithm  remains  one-step  in  character. 
However,  it  is  insufficient  just  to  consider  the 
order  of  the  interpolation  error  -  its  actual  size 
is  important.   The  interpolation  error  at  a  point 
t   is  approximately  proportional  to 

p(t)  =  (t  -  X  -  H)2(t-X)2(t-X+H')2.   (3.1) 


If  we  consider 


max      |p(t) 
te[X-H',X+H] 


function  of   r  =  H/H'   then  its  minimum  is  attained 
when  r=l   and  it  increases  monotonically  and 
quickly  as   r   increases  taking  the  value  5  near 
r  =  2   and  values  of  over  100  for  r  <  5.   The 
maximum  value  of  ||p||  is  attained  on  the  longer 
of  the  subintervals   [X-H*,x]   and   [X,  X+H]. 
We  assume  that  interpolation  is  generally  only 
required  on  the  step   [X,  X  +  H]   and  therefore  the 
large  values  of   ||p||  imply  that  we  should  restrict 
stepsize  increases  to  bound  the  increase  in  inter- 
polation error  from  one  step  to  the  next.   We 
choose  an  upper  bound  of   r  =  2  on  this  evidence. 
Two  subroutines  D02XAF  and  D02XBF  are  provided  to 
perform  the  interpolation  using  information 
retained  in  Y,  W  and  COUT  on  exit  from  D02PAF. 
Routine  D02XAF  computes  an  approximate  solution 
at  any  point  in   [X-H',  X  +  H],  whilst  D02XBF 
computes  a  selected  component  of  the  solution  in 
the  same  range.  It  was  felt  that  to  supply 
routines  for  these  two  extreme  cases  was  justi- 
fiable but  that  any  more  complicated  selection 
of  components  might  be  more  expensive  to  calculate 
than  repeated  calls  to  D02XBF. 

It  is  widely  accepted  that  the  error  in 
integrating  the  system  (1.1)  should  be  approx- 
imately proportional  to  TOL.   To  ensure  this  is 
the  case,  routine  D02PAF,  in  common  with  most 
other  modern  subroutines,  is  designed  to  vary  the 
stepsize  to  make  the  local  error  vary 
proportionally  and  smoothly  with  TOL.   One  asepct 
of  this  strategy  is  to  avoid  choosing  stepsizes  on 
the  evidence  of  insignificant  error  estimates  (see 
section  2).   These  insignificant  estimates  can 


arise  in  two  ways.   During  an  integration,  the  error 
estimate  is  unlikely  to  become  insignificant  unless 
TOL  is  so  small  that  the   stepsize  becomes  very 
small.   If  this  occurs  an  error  exit  is  taken  (other 
subroutines  have  error  exits  for  the  related  case 
that  too  much  accuracy  is  requested).   Alternatively 
an  insignificant  error  estimate  can  be  returned  by 
D02YAF  during  the  initial  stepsize  iteration.   This 
is  just  taken  as  a  sign  that  the  current  stepsize  is 
too  small  and  the  stepsize  is  increased  and  the 
iteration  proceeds.   An  indication  of  the  dependence 
of  the  local  error  on  TOL  is  given  through  two 
elements  of  the  array  COUT.   A  count  is  kept  of  the 
total  number  of  successful  steps  and  the  number 
using  the  maximum  stepsize.   If  the  second  count  is 
large  relative  to  the  first  then  one  might  infer 
that  this  choice  of  TOL  has  not  controlled  the  local 
error. 

4.    Driver  Subroutines 

There  are  five  driver  subroutines  for  D02PAF. 
The  first  and  simplest,  D02BAF,  integrates  the 
system  (1.1)  across  the  range   [X,  XEND],   It  is 
intended  for  very  inexperienced  users  and  hence  has 
a  particularly  simple  calling  sequence  designed  to 
define  the  problem,  input  a  local  error  tolerance 
TOL,  designate  some  workspace  and  permit  error 
exits.   The  second  routine,  D02BBF,  is  similar  to 
D02BAF  but  implements  slightly  more  general  error 
control  (similar  to  the  use  of  CIN(2)  =  0.0,  1.0  or 
2.0  in  D02PAF),  and  permits  output  at  user-selected 
points.   This  output  is  implemented  through  a  user- 
supplied  subroutine  OUT.   This  routine  is  called  at 
the  initial  point  X  by  D02BBF  and  a  parameter  of 
OUT  must  be  changed  to  the  next  output  point  and  so 
on.   Hence,  when  OUT  is  called  at  the  current  output 
point,  it  must  redefine  the  output  point  in  addition 
to  performing  its  printing  task.   Internally  D02BBF 
uses  the  interrupt  C0MM(4)  <  0.0  to  pass  output 
points  to  D02PAF. 

There  are  also  two  driver  routines  designed  to 
solve  equations  posed  in  terms  of  the  solution)  as 
the  integration  proceeds.   The  simpler  of  these, 
D02BGF,  solves  an  equation   yf(x)  =  v   for  x   and 
exploits  the  interrupt  C0MM(3)  >  0.0  to  locate  the 
position  of  x  approximately.   The  second,  D02BHF, 
solves  a  general  equation  g(x,  y)  =  0  by  locating 
a  change  in  sign  of  g  using  the  interrupt 
C0MM(4)  >  0.0  (interrupt  at  every  step).   In  both 
cases  the  equations  are  solved  to  machine  precision 
using  the  interpolants  defined  by  D02XBF  and  D02XAF 
respectively  and  using  a  secant/bisection  rootfinder 
C05AZF  based  on  an  algorithm  by  Bus  and  Dekker  (1975) 
The  equations  are  solved  to  machine  accuracy  to  put 
all  the  emphasis  on  the  accuracy  of  the  integration, 
so  that  by  varying  the  local  error  tolerance  TOL  we 
can  simply  control  the  accuracy  of  the  solution  of 
the  equations. 

The  last  driver  routine,  D02BDF,  is  designed  to 
solve  (1.1)  whilst  producing  a  global  error  estimate 
and  a  variety  of  related  statistics,  and  to  give  an 
indication  whether  the  system  (1.1)  is  stiff.   The 
global  error  estimate  is  calculated  using  the  two- 
and-one  technique  discussed  in  Shampine  and  Watts 
(1976b).   The  actual  solution  output  should  be 
close  to  that  which  would  be  calculated  by  a  call 
to  routine  D02PAF  with  the  same  value  of  TOL  (though 
it  is  calculated  by  calls  to  D02YAF  only).   The 
calculation  proceeds  by  interrupting  D02PAF  at  each 
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and  taking  two  hall  stops  with  DOJYAF  to  calc- 
ulate .in  auxiliary  solution,  the  difference  between 
the  two  solutions  giving  the  estimate.   The  other 
statistics  produced  include  the  maximum  error 
estimate  in  magnitude  in  the  integration  for  each 

t  .inJ  information  about  the  monotonic  growth 
of  the  error  estimate  .iiui  any  sign  changes  in  the 
estimate.   The  user  is  permitted  to  specify  a 
maximum  size  for  the  solution  and  this  can  be  used 
to  prevent  overflow  during  calculation  of  the 
various  solutions.   Overflow  is  more  likely  to 
occur  during  calculation  of  the  solution  using 
DC2YAF  as  there  is  no  error  control  on  this 
solut  i  on. 


SHAMPINE,  L.F.  &  WATTS,  H.A.  (1976a)  Practical 

Solution  of  Ordinary  Differential  Equations 
by  Runge-Kutta  Methods,  Rept .  SAND  76-0585, 
Sandia  Labs.,  Albuquerque,  New  Mexico. 

SHAMPINE,  L.F.  &  WATTS,  H.A.  (1976b)  Global  Error 
Estimation  for  Ordinary  Differential 
Equations,  TOMS,  2,  pp. 172-186. 


The  stiffness  check  is  based  on  two  separate 
calculations.   The  first  uses  the  ideas  of  Shampine 
(1977)  where  an  algorithm  is  described  for  a  Runge- 
Kutta-Fehlberg  method.     The  stiffness  check  is 
based  on  a  comparison  of  stepsizes  predicted  from 
the  Fehlberg  algorithm  and  Euler's  method.   The 
comparison  is  dependent  on  certain  properties  of 
the  absolute  stability  regions  of  the  two  methods. 
It  is  fortuitous  that  the  stability  region  of  the 
Merson  algorithm  used  in  D02PAF  are  as  well- 
suited  to  this  application  as  the  Fehlberg 
stability  region.   The  second  stability  check  is 
based  on  the  simple  observation  that  if  a  stiff 
problem  is  integrated  using  a  non-stiff  solver  such 
as  D02PAF,  the  stepsizes  used  are,  in  the  main, 
controlled  by  the  requirement  of  stability  rather 
than  by  the  local  error  tolerance  TOL .   Hence,  if 
separate  calls  to  D02PAF  are  made  with  local  error 
tolerances  TOL  and  2P  *  TOL,  twice  the  number  of 
steps  should  be  used  in  the  first  case  if  the  step- 
size  is  being  controlled  by  the  local  error 
tolerance.   We  compute  a  stiffness  check  by 
measuring  the  deviation  from  this  expected 
behaviour.   In  our  tests  we  have  found  these  two 
checks  approximately  equally  reliable  (but  the 
second  is  more  expensive  to  compute  and  hence  the 
user  may  opt  to  avoid  it).   In  fact  we  have  also 
tested  other  possible  stiffness  checks.   For 
example,  we  have  counted  the  number  of  unsuccessful 
steps  relative  to  the  number  of  successful  ones  for 
various  values  of  TOL.   One  can  formulate  theories 
about  the  behaviour  of  this  ratio  for  stiff  systems 
but  unfortunately  this  behaviour  is  not  restricted 
to  such   systems  and  the  ratio  does  not  provide  a 
reliable  measure. 
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The  aim  of  this  paper  is  to  present  methods  to  improve  the  robustness  of  a  general  purpose 
run-time  system  for  digital  simulation  of  continuous  systems. 

This  aspect  of  simulation  software  robustness  is  just  one  of  several  possible  aspects.  One 
could  as  well  discuss  the  robustness  of  a  language  to  formulate  simulation  problems  (simula- 
tion language)  or  the  robustness  of  a  compiler  for  such  a  language  (simulation  compiler). 
Such  aspects  will,  however,  not  be  considered  in  this  article. 

The  run-time  system  robustness  is  a  very  important  aspect  which  has  not  sufficiently  been 
taken  into  account  in  the  development  of  existing  simulation  software.  The  aim  of  this  paper 
is  to  specify  demands  rather  than  to  present  optimal  solutions.  Solutions  are  presented  from 
an  engineering  point  of  view.  They  improve  the  run-time  system  robustness  to  some  extent, 
but  better  solutions  can  certainly  be  found.  It  is  hoped  that  some  of  the  Numerical  Mathe- 
maticians attending  this  Conference  may  find'  these  problems  of  interest  and  may  develop 
better  solutions  to  them  in  the  future.  This  is  one  of  the  main  reasons  why  the  authors  de- 
cided to  present  this  material  at  a  Conference  of  Numerical  Mathematics  rather  than  at  a 
Conference  of  Simulation  Techniques. 


I)  INTRODUCTION: 


A  simulation  run-time  system  can  be  robust  in  two 
senses : 


produced  results.  For  this  reason,  it  is  vital  that 
each  algorithm  in  the  system  has  its  own  "bell" 
which  rings  as  soon  as  it  is  unable  to  properly 
proceed.  Under  no  circumstances  are  incorrect 
results  allowed  to  be  displayed  to  the  user. 


a)  The  user  should  never  be  required  to  provide  any 
kind  of  information  which  he  does  not  have  at  his 
disposal.  He  should  be  able  to  concentrate  on  those 
factors  which  have  to  <^o  with  the  statement  of  his 
problem,  and  should  be  relieved,  as  much  as  pos- 
sible, of  all  aspects  which  have  to  do  with  the  way 
his  problem  is  executed  on  the  machine.  He  should 
be  able  to  describe  his  system  as  easily  as  pos- 
sible in  terms  which  are  closely  related  to  his 
common  language,  but  must  not  be  required  to  pro- 
vide a  step-size  for  the  numerical  integration  or 
to  specify  the  integration  algorithm  to  be  used. 

b)  The  run-time  software  itself  must  be  able  to 
check  whether  the  produced  time  responses  are 
"correct"  (within  a  prescribed  tolerance  range). 
The  user,  normally,  has  a  more  or  less  precise 
(although  often  not  mathematically  formulated) 
knowledge  of  the  system  he  is  investigating.  He 
has,  however,  hardly  any  "insight  information"  into 
the  tool  he  is  using  for  that  task.  He  is,  usually, 
very  credulous  (the  obtained  results  must  be  cor- 
rect because  the  computer  displays  14  digits!),  and 
he   has   no  means   to  judge  the  correctness  oi  thi 


II)  AUTOMATED  SELECTION  OF  INTEGRATION 
ALGORITHMS: 


A  huge  step  towards  robust  simulation  software  has 
been  taken  in  the  development  of  step-size 
controlled  integration  algorithms.  Before  these 
algorithms  existed,  the  user  of  digital  simulation 
software  was  required  to  supply  information  concer- 
ning the  step-size  to  be  used  —  an  information 
item  which  he  clearly  did  not  have  at  his  disposal. 
Now,  the  user  can  simply  provide  a  tolerance  range 
for  the  accuracy  of  the  results.  This  is  identical 
to  requesting  the  user  to  identify  the  smallest 
number  in  his  problem  which  can  be  distinguished 
from  zero.  This  question  can  certainly  be  answered 
by  any  user,  independently  of  whether  he  is  an 
expert  in  Numerical  Mathematics  or  not,  since  it  is 
closely  related  to  the  physics  of  the  problem,  and 
not  to  the  numerical  behaviour  of  the  algorithm. 

Available  simulation  software,  up  to  now,  usually 
offers  a  comprehensive  selection  of  different  inte- 
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tion   algorithm*.  It  does,  however,  not  toll  the 

:  which  would  be  the  most  appropriate  one  for 
his  particular  application.  In  this  way,  the  user 

igaifl  confronted  with  making  a  decision  on  some- 
thing he  does  not  really  understand.  Experience  has 

~-n  that  the  majority  of  the  average  users  always 
operate  with  the  default  integration  method  imple- 
mented in  the  package  which,  in  most  cases,  is  a 
Runge-Kutta  algorithm  of  4th  order.  Since  he  does 
not  know  what  to  specify,  he  simply  ignores  that 
question,  and  after  some  time  of  using  the  software 
he  has  even  forgotten  that  the  language  provides 
him  with  the  facility  to  select  among  different 
integration  algorithms.  So  far,  no  integration 
algorithm  could  be  found  which  would  be  able  to 
handle  all  kinds  of  problems  equally  well,  and  it 
is  more  than  doubtful  whether  such  an  algorithm 
could  be  found  at  all.  The  user,  who  does  not  make 
use  of  the  facility  to  select  among  different 
integration  rules,  will,  consequently,  often  waste 
a  lot  of  computing  power.  Although  much  research 
has  been  devoted  to  the  development  of  different 
integration  methods  for  the  different  classes  of 
application  problems  [3,8],  the  user  has,  however, 
no  means  to  easily  determine,  from  the  state  space 
description,  the  problem  class  to  which  his  par- 
ticular application  belongs.  For  this  reason,  the 
selection  of  the  appropriate  integration  algorithm 
should  also  be  automated. 

For  this  purpose,  we  try  to  extract  features  from 
an  application  problem  during  its  execution  which 
are  supposed  to  characterize  the  numerical  be- 
haviour of  that  particular  problem  as  completely  as 
possible.  These  features  are  then  combined  in  a 
feature  space  in  which  we  can  identify  specific 
clusters  for  which  a  particular  integration  method 
is  optimally  suited.  The  proposed  methodology  for 
the  solution  to  this  problem  originates  from  pat- 
tern recognition. 

What  features  may  be  used  for  this  purpose?  A  first 
feature  can  be  associated  with  the  accuracy  re- 
quirements for  the  problem.  It  can  be  found  that 
the  CPU-cost  to  execute  a  particular  problem 
depends  on  the  required  relative  accuracy.  Low 
order  algorithms  are  appropriate  for  the  treatment 
of  systems  from  the  "gray-"  and  "black  box"  area 
where  the  available  data  and  models  are  so  vague 
that  a  precise  numerical  integration  does  not  make 
much  sense,  whereas  higher  order  algorithms  are  ap- 
propriate for  the  handling  of  systems  form  the 
"white  box"  area,  e.g.  from  celestrial  mechanics. 
Since  the  user  is  requested  to  specify  the  wanted 
relative  accuracy,  this  feature  can  be  extracted 
from  the  input  data. 

Multi-step  methods  require  more  CPU-time  during 
their  initial  phase,  but  are  more  economic  than 
one-step  methods  if  integration  goes  on  over  a 
longer  interval  of  simulated  time.  This  can  be  ex- 
plained by  the  fact  that  one-step  methods  are 
self-starting  whereas  multi-step  methods  need  to  be 
initialized.  This  leads  to  a  second  feature.  Since 
the  integration  has  to  be  restarted  after  "event 
times"  (instants  at  which  some  state  trajectories 
are  discontinuous),   multi-step   integration  is  in 


favour  for  purely  continuous  problems  or  for 
problems  with  few  event  times,  where. is  one-step 
integration  is  appropriate  Tor  combined  (con- 
tinuous/discrete) problems  where  discrete  events 
occur  with  a  high  density. 

Each  integration  algorithm  has  associated  with  it  a 
domain  of  numerical  stability.  The  stiffer  a  parti- 
cular problem  is  (the  more  the  different  eigen- 
values (A.)  of  the  Jacobian  are  separated),  the 
smaller  the  step-size  (h)  must  be  in  order  to  keep 
all  (X.*h)  within  the  stability  region  of  the  algo- 
rithm. Fortunately,  special  integration  algorithms 
could  be  found  for  which  this  restriction  no  longer 
holds  (A-stability,  stiff  stability).  If  such  an 
algorithm  is  used,  the  step-size  need  not  be  re- 
duced due  to  restrictions  imposed  by  stability  de- 
mands, but  is  determined  exclusively  by  the  re- 
quirements of  accuracy.  For  this  reason  the  eigen- 
value distribution  of  the  Jacobian  determines  a 
third  feature  which  must  influence  our  decision  as 
to  which  integration  algorithm  to  use  for  the 
execution  of  a  particular  problem. 

/ 
These   three  features  can  now  be  combined  to  a  fea- 
ture space  as  depicted  in  fig.l. 


i t  Accuracy 


Stiffness 


Smoothness 


Fig.l:  Feature  space  for  selection 
of  integration  algorithm 

18  clusters  have  been  distinguished  in  fig.l,  and 
fig. 2a  and  2b  show  integration  rules  which  can  be 
associated  with  them. 
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Fig. 2:  Integration  algorithms  to  be 
used  for  smooth  (left)  and 
non-smooth  (right)  problems 
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As  can  be  seen  from  fig. 2,  some  of  the  assignments 
are  still  open. 

Concerning  (a) :  A  Gear  algorithm  [3]  would  be  ap- 
propriate, but  it  should  be  of  about  8th  order  (at 
least  for  a  CDC  6000  series  installation  —  this 
depends  on  the  length  of  the  mantissa),  whereas,  in 
the  Kahaner  implementation  [6]  we  use,  we  have  only 
up  to  5th  order  available. 

Concerning  (b) :  For  these  clusters,  a  one-step 
algorithm  with  a  stiff  stability  behaviour  would  be 
most  suitable.  So  far,  we  have  experimented  with 
IMPEX-2  [9],  and  with  DIRK  [1],  but  the  programming 
style  of  these  algorithms,  as  they  are  at  our  dis- 
posal at  the  moment,  is  not  sufficiently  elaborate 
to  allow  for  a  fair  comparison  with  the  extremely 
careful  and  sophisticated  Kahaner  implementation  of 
the  Gear  algorithm. 


Jacobian  out  of  the  state  space  description  of  the 
problem.  This  can  either  be  numerically  approxi- 
mated at  run-time,  or  one  can  compute  it 
algebraically  by  means  of  formulae  manipulation  at 
compile-time.  This  is  rarely  done  by  available 
simulation  compilers  but  it  is  feasible,  and  seems 
to  be  a  promising  approach.  The  wanted  features  can 
now  be  computed  by  estimating  the  critical  eigen- 
values. The  eigenvalues  with  the  largest  and  smal- 
lest absolute  values  can  be  approximated  by  appro- 
priate matrix  norms,  whereas  the  real  part  of  domi- 
nant poles  can  be  found  by  estimating  the  "margin 
of  stability"  [5,7].  However,  since  several  quan- 
tities are  needed,  we  found  that  it  is  in  most 
cases  faster  to  compute  the  whole  set  of  eigen- 
values directly  by  use  of  the  EISPACK  software  [2]. 
The  required  CPU-time  turned  usually  out  to  be  neg- 
lectable  compared  to  the  time  spent  for  numerical 
integration. 


Concerning  (c) :  For  this  cluster,  a  high-order 
stiffly-stable  one-step  method  would  be  most  appro- 
priate. Such  an  algorithm  is,  however,  unknown  to 
date. 


Ill)  ADAPTIVE  SELECTION  OF  INTEGRATION 
ALGORITHMS : 


As  a  matter  of  fact,  the  feature  space  as  depicted 
in  fig.l  is  still  a  simplification.  To  show  this, 
let  us  consider  a  system  with  complex  dominant 
poles  close  to  the  imaginary  axis.  For  the  treat- 
ment of  such  a  system,  a  stiffly-stable  method 
cannot  be  properly  applied.  The  system  has, 
however,  fast  transients,  making  a  Runge-Kutta 
algorithm  not  suitable  either.  Thus,  these  types  of 
systems,  which  are  called  "highly  oscillatory" 
systems,  will  again  require  special  methods  (like 
stroboscopic  methods)  for  efficient  handling 
(C.W.Gear:  private  communication).  This  establishes 
a  fourth  feature  which  is  to  be  used  for  the  deter- 
mination of  the  integration  method.  The  reason  for 
the  primary  simplification  lies  in  the  fact  that  a 
3-dim.  feature  space  can  be  graphed  easier  than  a 
4-dim.  one  ( ! ) . 


In  a  nonlinear  case,  the  Jacobian  will  usually  be 
time  dependent,  and,  with  it,  also  its  eigenvalues. 
Since  the  classification  is,  in  general,  defined 
for  linear  systems,  it  may  well  be  that  in  a  non- 
linear case  it  would  be  best  to  assign  the  integra- 
tion algorithm  dynamically  to  the  problem.  For  this 
purpose,  one  has  to  recompute  the  eigenvalues  from 
time  to  time  to  find  out  whether  the  integration 
method  in  use  is  still  appropriate.  It  seems  a  good 
idea  to  recompute  the  eigenvalues  as  soon  as  the 
step-size,  which  is  controlled  by  the  integration 
rule,  has  changed  by  an  order  of  magnitude,  but  not 
before  a  minimum  time  span  of  maybe  0.01  times  the 
run  length  has  elapsed.  This  can  then  be  used  to 
obtain  an  adaptive  selection  of  the  appropriate 
integration  scheme. 


The  information  provided  by  these  four  features  is 
sufficient  to  determine  the  best  suited  integration 
algorithm  for  most  application  problems. 


IV)  VERIFICATION  OF  SIMULATION  WITH 
RESPECT  TO  MODELING: 


There  are  even  two  more  features  which  can  be  ex- 
tracted from  the  eigenvalue  distribution  of  the 
Jacobian.  These  are  used  for  other  purposes,  and 
will  be  presented  in  due  course. 

So  far  we  have  defined  features,  and  we  have  as- 
sociated integration  methods  with  them.  It  remains 
to  determine  how  these  features  can  be  extracted 
from  the  state  space  description  of  the  problem. 
The  first  feature  (relative  accuracy)  is  user 
specified  on  data  input.  The  second  feaLure 
(smoothness)  could  also  easily  be  user  provided.  It 
is,  however,  a  simple  task  to  detect  automatically 
whether  a  problem  is  continuous  or  combined.  If  the 
problem  turns  out  to  be  combined,  one  can  count  the 
number  of  event  times  during  a  certain  period  of 
simulated  time,  and  decide  then  whether  it  is  a 
smooth  or  a  non-smooth  combined  problem.  Concerning 
r th  features  (stiffness  /  highly 
iy  behaviour)   one  has   to  compute   Che 


Let  us  assume  that  a  valid  model  has  been  derived 
from  the  physical  system  under  investigation,  and 
let  us  question  what  assurance  we  have  that  the 
time  responses  which  we  obtain  through  simulation 
represent  the  (valid)  model  correctly. 

For  a  variable-step  integration  method  being  used, 
we  normally  trust  in  the  step-size  control  mecha- 
nism which  is  equivalent  to  confiding  in  the  error 
estimation  procedure.  This  will  usually  be  justi- 
fied as  long  as  the  local  error  which  we  control 
can  be  used  as  a  valid  estimate  for  the  global 
error  in  which  we  are  interested. 

Experience  has  shown  that  local  errors  will  not 
usually  accumulate  as  long  as  the  system  is 
numerically  stable.  In  a  nonlinear  case,  it  may, 
however,  happen  that  some  eigenvalues  "walk"  into 
the  right  half-plane  for  a  short  period  of 
simulated  time.  Let  us  consider,  as  an  example,  the 
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well  known  Van-der-Pol  equation.   Fi£.  i   shows   how 
^envalues   "walk  around"  during  one   limit 


Fig. 3:  Eigenvalue  movement  of  the 
Van-der-Pol  equation  during 
one  limit  cycle 


Eot  given  experimental  cond i t i mis ,  properly  repre- 
sents the  system  under  investigation.  There  exist 
several  possible  means  to  help  the  user  in  this 
spect  (like  asking  him  to  supply  dimensions  lor  .ill 
variables  in  the  system  to  enable  an  automated  di- 
mensional analysis).  In  this  section  we  want  to 
show  that  the  eigenvalue  distribution  can  also  help 
to  answer  this  question  to  some  extent. 

It  has  been  shown  in  [A]  that  only  those  eigen- 
values of  a  matrix  can  be  properly  computed  which 
fulfil  the  following  inequality: 


>  a   *e 


(i/n) 


where  a        is   the   largest   singular  value  of  the 
matrix,  £  is  the  machine  resolution  (e.g.  MO  '  "*  on 


a  CDC  6000  series  installation),  and  n  is  equal  to 
the  order  of  the  model.  For  higher  order  models 
e"n  approaches  1.0  and  hardly  any  eigenvalues  will 
then  be  properly  computable.  Smaller  eigenvalues 
can  take  any  value  and  small  modifications  of  the 
elements  of  the  matrix  can  place  them  almost  any- 
where within  the  band  of  incertainty. 


As  can  be  seen,  the  system  becomes  periodically  un- 
stable. During  such  time  intervals,  errors  will  ac- 
cumulate, and,  consequently,  we  must  be  careful  in 
the  interpretation  of  the  obtained  results.  This 
fact  should  be  reported  by  the  software  to  the 
user. 

For  this  purpose,  we  define  another  feature 
(stability).  A  variable  STAB  is  set  equal  to  zero 
when  all  eigenvalues  lie  in  the  left  half-plane  and 
is  equal  to  one  as  soon  as  at  least  one  of  the 
eigenvalues  moves  into  the  right  half-plane. 


STAB  -  { 


0.0  :  Re{X.}  <  0.0  ;  i=l, 


1.0  :  otherwise 


We  now  collect  statistics  on  STAB  as  for 
time-persistent  variables.  In  this  way  we  obtain 
the  integral  of  STAB  over  time  divided  by  the  run 
length: 


1.0 


FF5 


"fin 


beg1 


J 


fin 
(STAB)dt 


beg 


The  fifth  feature  (FF5)  is  a  real  number  between 
0.0  and  1.0.  If  it  is  close  to  0.0,  the  results  ob- 
tained by  simulation  have  a  good  chance  to  be  re- 
liable. If  it  is  close  to  1.0,  the  obtained  results 
are  most  probably  nonsense,  and  they  must  be  cau- 
tiously verified. 


If  we  now  assume  that  the  matrix  under  investiga- 
tion is  a  Jacobian  of  a  state  space  description  for 
a  real  physical  process,  then  the  elements  of  the 
Jacobian  are  extracted  from  measurements,  and 
cannot  be  computed  more  accurately  than  e,  which  is 
a  relative  accuracy  of  measurement.  We,  therefore, 
must  assume  that,  within  that  relative  accuracy  e, 
the  elements  of  the  matrix  can  take  any  values.  In 
this  case,  we  must  also  assume  that  eigenvalues  of 
the  Jacobian  which  do  not  fulfil  the  more  stringent 
inequality: 


>  a  *e 


0/n) 


can  take  any  value  within  that  broader  band, 
although  they  can  be  much  more  accurately  computed 
as  soon  as  any  particular  values  have  been  assigned 
to  all  elements  of  the  Jacobian.  This  means  that  as 
soon  as  there  exist  eigenvalues  for  which  the 
second  (more  stringent)  inequality  does  not  hold, 
small  variations  in  the  systems  parameters  which 
lie  within  the  inaccuracy  of  the  measurement  can 
make  the  model  non-stiff  or  stiff  or  even  unstable. 
Physically  seen,  these  eigenvalues  correspond  to 
merely  constant  modes  which  could  as  well  be 
eliminated  from  the  equation  set  resulting  in  a 
model  reduction.  Numerically  seen,  these  eigen- 
values can  lead  to  accumulation  of  errors  so  that 
these  modes  can  drift  away  over  a  longer  span  of 
simulated  time,  again  resulting  in  incorrect 
simulation  trajectories. 

Together  with  the  eigenvalues,  we  compute  the  fol- 
lowing quantity: 


V)  VALIDATION  OF  THE  MODEL  WITH  RESPECT 
TO  THE  SYSTEM  UNDER  INVESTIGATION: 


Another  non-trivial  question  is  whether  a  model, 


BORD 


and  the  number  k  indicating  those  eigenvalues  whose 
absolute  value  is  smaller  than  BORD: 
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n  0 :  |  A  .  |  >_  BORD 

k  =  2C(j.)  5  j.  =  { 

i=l  1:  I  A. I  <  BORD 


k  represents  an  integer  between  0  and  n. 

We  now  collect  statistics  on  the  quantity  (k/n)  as 
for  time-persistent  variables,  and  obtain  a  sixth 
feature: 


FF6 


1.0 
"fin   "beg 


"fin 


*  f(k/n)dt 

t,-.   -  t,      -* 


beg 


Also  the  sixth  feature  (FF6)  is  a  real  number  bet- 
ween 0.0  and  1.0.  If  it  is  close  to  0.0,  the  model 
has  some  chance  to  be  valid.  If  it  is  close  to  1.0, 
the  model  is  most  probably  invalid,  and  it  should 
be  further  investigated. 

Evaluation  of  features  FF5  and  FF6  requires  com- 
putation of  the  eigenvalues  of  the  Jacobian  once 
per  integration  step.  Since  this  can  be  expensive, 
it  should  not  be  done  automatically,  but  the  user 
must  have  a  switch  at  his  disposal  to  turn  computa- 
tion on  and  off.  In  this  way  he  can  use  these  fea- 
tures during  the  development  of  a  new  model, 
whereas  he  can  turn  computation  off  during  produc- 
tion runs. 


VI)  DETERMINATION  OF  CRITICAL  STATES: 


In  section  V  we  have  discussed  the  case  where 
single  eigenvalues  were  situated  close  to  the  ima- 
ginary axis,  and  we  have  seen  that  in  such  a  case 
it  may  be  possible  to  reduce  the  order  of  the 
model . 

It  is,  however,  as  interesting  to  discuss  the 
oposite  case  where  single  eigenvalues  are  located 
in  the  A-plane  far  to  the  left.  We  call  these  modes 
the  "critical  states"  of  the  system.  Very  often  one 
is  not  really  interested  in  these  fast  transients. 
In  such  a  case  one  could  eliminate  these  modes  from 
the  equation  set.  If  the  fast  transients  are  impor- 
tant one  could  at  least  try  to  utilize  special 
integration  techniques  (like  using  singular  pertur- 
bations) to  expedite  integration. 

One  can,  of  course,  again  compute  the  eigenvalue 
distribution  for  the  solution  of  this  problem. 
However,  it  is  not  always  easy  to  see  which  state 
equations  are  responsible  for  such  an  eigenvalue. 
For  this  reason  we  recommend  the  following  pro- 
cedure. 

We  reserve  an  integer  array  i   of  length  n  which  is 

initialized  to  zero.  Each  time  an  integration  step 

has  to  be  rejected  due  to  accuracy  requirements  not 

met,   we  add   1   to  each  element  of   the 

i  <k)    for  which  the  accuracy  is  not  met.  This 
n 
implies,  of  course,  that  the  local  truncation  error 

is  estimated  Cot  all  state  variables  independently. 


At  the  end  of  the  simulation  run  we  divide  each 
element  of  the  array  by  the  total  number  of  re- 
jected integration  steps  and  obtain  in  this  way  an- 
other set  of  n  real  numbers  between  0.0  and  1.0. 
Elements  with  the  largest  value  indicate  critical 
states. 
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THE  A-STABILITY  OF  EXPONENTIALLY  FITTED  LINEAR  MULTI-STEP 
METHODS  EXACT  ON  PERIODIC  FUNCTIONS 


Tsvi  Bar-David  (White) 
Lockheed  Missiles  &  Space  Company,  Inc. 


ABSTRACT 

A  linear  multi-step  (LMS)  method  for  solving 
ordinary  differential  equations  (ODE's) 


dv_ 
dt 


f(y.t)  ,  y(t  )  =  yr 


f('.t)  :    R" 


produces  a  numerical  solution  u  (h)   y(t  )  via 

n    —    n 

the  recursion 


k 

I 

j-o 


a  u 


k 
h  Z 


j-0 


J  n+j 


n+j 


f(un+j'  W'  Vj  =  Co  +  h(n+j) 


If  a,  ^  0  it  is  called  a  k-step  method.  A  k-step 

method  is  said  to  integrate  exactly  a  set  F  of 

scalar  functions  if,  whenever  it  is  applied  to  an 

ODE  whose  solution  is  a  linear  combination  of 

functions  in  F,  it  produces  a  numerical  solution 

equal  to  the  ODE  solution  at  the  mesh  points 

t  =  t  +  nh.   A  k-step  method  is  called 
no 

exponentially  fitted  to  (the  complex  number) 

X  of  the  order  p  if  it  integrates  exactly 

P(t)e   for  all  polynomials  P(t)  of  degree  <_  p  . 

A  k-step  method  may  be  fitted  to  a  set 
of  distinct  X's  of  different  orders.  A 
standard  k-step  method  of  order  p  is  nothing 
more  than  an  exponentially-fitted  method 
fitted  to  (and  only  to)  X  ■  0  of  order  p  . 


each  fixed  stepsize  h  >  0  is  bounded  for  all  n  . 
We  prove  that  if  a  k-step  method  is  exponentially 
fitted  to  +iw  (w  real)  of  order  p  >^  0  and  is 
A-stable,  then  the  method  is  implicit  and 
p  <_  2    .   This  extends  the  well-known  result  of 
Dahlquist  (1963)  which  states  that  if  a  standard 
k-step  method  of  order  p  is  A-stable,  then  it  is 
implicit  and  p  _<  2  . 

We  show  that  if  an  exponentially-fitted 
method  is  exact  on  (and  only  on)  the  functions 


1,  e 


V 


X  t 

,  P 


(the  X.  need  not  be  distinct)  then  its  local 
truncation  error  x  satisfies 


hi  =  c  hP+1  Try  +  0(hP+2) 
n 


Try(t)  =  D(D  -  Xx) 


(D  -  X  )y(t)  , 


We  then  prove  that  the  usual  Milne  device  can  be 
used  to  estimate  the  local  truncation  error  of 
an  exponentially-fitted  predictor-corrector  pair 
whenever  both  the  predictor  and  the  corrector  are 
exact  on  the  same  set  of  functions. 


A  k-step  method  is  called  A-stable  if,  when- 
ever it  is  applied  to  an  ODE  of  the  form 

y  =  Xy  with  ReX   <  0  , 

it  produces  a  numerical  solution  u  (h)  which  for 

n 
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where  f  (t , y (t )  )  i 
function  on  0<t<T 


is  a  real  continuous 
on  0<t<T,  -m<y<m,     satisfyi: 

f  (t,: 


,  n. 

re  the  y  ,  1=0, 1 , - , mk-1  are  given 
initial  values  (usually  only  specified 


for  i= o ,m , - , m (k-1 ) )  and  V  is  a  positive 

integer  such  that  V  S     ( i-mk  +  1 ) modm .    The 

coefficients  are  real  and  we  take 

(V) 
a  ,   =1  for  all  v  =  1,2, -,m. 
mk 

In  matrix  notation  this  yields 

g  (2) 


v 


V 


hB.  f 

h 


where 


and 


(fo'fi'"'V 


<yo'?l'-'ymk-l'°'-'0) 


The  matrix  AL  has  the  structure 

n 


(3) 


(N+l) X  (N  +  l) 


where 


and  B   has  a  similar  structure,  and  the 
order  of  E  is  independent  of  N. 

Dahlquist  stability  can  then  be 
interpreted  as  requiring  that3M» 
independent  of  h  (and  N),  such  that 

IIa"1!!.  <  M/h 


(4) 


and  absolute  stability  by  requiring  that 
"^  M',  independent  of  N,  such  that 

I  I  <Ah  -  h  Bh)_1|  \m    <     M-      (5) 

where  h  =  hX  and  S,     has  been  applied  to 

x  n 

the  usual  test  equation  y'  =Xy. 
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It  is  important  to  have  a  more 
pragmatic  characterisation  of  (A)     and  (5) 
which  is  usually  presented  in  the  form  of 
a  root  condition.    This  paper  is  concerned 
with  many  different  possible  character- 
isa t  ions . 

The  formulation  presented  here  (even 
though  it  is  only  an  example)   s  quite 
general  and  includes  predictor-corrector 
(hybrid  or  otherwise)  in  any  mode, 
extrapolation  (but  not  the  G.B.S.  method), 
Runge-Kutta,  cyclic  methods,  and  with 
a  small  modification  advanced  linear 
multistep  methods. 


Of  course  in  general  A 


and  Bw  will 
n 


not  be  band  lower  triangular  matrices 
but  will  be  simply  lower  triangular 
matrices ,  e.g. 


\  - 


<>Nt|)>i(,K+0 


For  example  this  arises  when  product 
integration  techniques  are  applied  to 
solve  the  Abel's  equation  (Holyhead  and 
McKee) .    Thus  this  paper  includes  a 
study  of  A   in  this  more  general 

formulation . 

Dahlquist  stability  and  absolute 
stability,  as  defined  by  (4)  and  (5), 
can  be  characterised  by  requiring  that 
the  eigenvalues  of  a  product  of  Frobenius 
matrices  all  lie  inside  or  on  the  unit 
circle,  those  on  having  linear 
elementary  divisors.    They  can  be 
characterised  by  the  approach  of 
Donelson  and  Hansen  (1971),  and  by  the 
generating  function  approach  of  Holyhead 
and  McKee  (1976).    This  paper  demonstrates 
the  precise  equivalence  of  these  and 
other  different  characterisations. 

These  results  have  evolved  from  and 
in  parallel  with  earlier  works  by  the 
author  and  his  co-workers.    A  list  of 
the  more  pertinent  references  can  be  found 
in  the  bibliography. 
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Some  Experiments  with  STRIDE 


Fred  Chipman 
Acadia  University 


0.  Introduction   STRIDE  is  an  integrator 
for  systems  ot  ordinary  differential  equa- 
tions, designed  by  Kevin  Burrage,  J.C. 
Butcher  and  Fred  Chipman.   In  section  1,  a 
brief  description  of  the  method  will  be 
presented,  followed  by  a  few  details  of 
the  algorithm.   Section  2  presents  prelim- 
inary results  of  experiments   with  STRIDE 
on  two  problems.   These  results  seem 
sufficiently  promising  to  justify  further 
work  on  this  type  of  implicit  Runge-Kutta 
method. 

1.  Description  of  the  method   The  algor- 
ithm implemented  in  STRIDE  is  based  on  the 
singly- imp lici t  methods  introduced  in 
Burrage  [3]  and  Butcher  [5].   These  meth- 
ods are  of  the  implicit  Runge-Kutta  type 
but  with  coefficient  matrix  so  chosen  as 
to  have  only  a  single  distinct  eigenvalue. 
When  the  method  of  implementation  des- 
cribed in  Butcher  [4]  is  used,  methods 
with  coefficient  matrices  of  this  type 
require  a  similar  number  of  arithmetic 
operations  for  the  performance  of  its  LU 
factorization  as  do  linear  multistep 
methods . 

In  the  usual  notation  for  Runge- 
Kutta  methods,  the  stages  are  given  by 

(1.1)   Yi=yn.1+h  I    ai.f(xn_1+hcj,Yj) 
j  =  1 

(i=l,2,...,s) 

and  the  result  at  the  end  of  a  step  by 
s 

Ci. 2)     yn=yn-i+h.I  bjf(xn-i+hcj>Yj)  • 

Let  £i,...£s  represent  distinct 
zeros  of  Ls(z),  the  degree  s  Laguerre 
polynomial".   Then,  with  A  any  positive 
number,  the  coefficients  for  the  particu- 
lar methods  we  are  considering  are  gi^cn 

Ci=A£i,  i=l...s, 

ind  bT=f  ]  ,...]/,  )\  L, 
whci  i  .J/  j  J  . 

latri)  i  ned , 


has  characteristic  polynomial  (z-A)s,  and, 
as  shown  in  Butcher  [5 ] ,  can  be  written 


AT  =  A 


1  . 

0 

1 

0    - 

•-1 

•  1 

where,  for  T=(tij)  and  T"1=(tT1), 
t^-LjUi)  and  ^--Itll^llp 


If 


'i=Yi-yn-i.  h  =f(xn.1+hci,Yi), 


(i=l,2,...s)  and  Z-,  F.  denote  the 

corresponding  transformed  quantities  given 
by 

CI. 3)  Zi=  I      tTjz.  ,  Fi-  I      t^F,, 


j-l 


ij  J 


j  =  l 


ij  J 


and  if  H=hA,  then  (1.1)  is  equivalent  to 
the  system 


(1.4) 


HF\ 


Zi  =  HFi-HFi_1(i=2,...s) 


and  (1.2)  is  equivalent  to 

s  1 
(1.5)   yn=yn-re  I   ±LJ(e)HF.,  where  e-i/x. 

i  =  l 

In  the  special  case  that  6=Cj  for  some  i, 
and  hence  c^=l,  the  computed  result  yn  is 
equal  to  Yj_,  and  the  method  is  of  order  s. 

This  choice  is  the  one  made  in  STRIDE 
and,  in  consequence,  the  stability  proper- 
ties of  the  methods  are  acceptable  for 
many  stiff  problems.   Since  the  expansion 
of  the  solution  given  by  (1.5)  is  equally 
valid  for  other  choices  of  6,  this  formula 
serves  as  a  basis  for  interpolation  of 
output  values  and  for  the  provision  of 
starting  values  for  the  iterative  evalua- 
tion of  Y.,Y-,...Y  .   For  stiff  problems, 

(1.4)  is  solved  using  a  modified  Newton 
iteration.   This  requires  the  LU  decom- 
position of  the  matrix  (1-11. J),  where  J 
I  notes  the  .lacobian  matrix  for  the 
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.rential  equation.   As  in  most  algor- 
ithms, J  is  not  computed  in  Bach  iteration, 
but  only  when: 

convergence  rate  is  unacceptable, 

(b)   H  has  changed  by  a  factor  greater 
than  2  or  less  than  .S  since  the 
last  evaluation, 

20  steps  have  ellapsed  since  the 
last  evaluation. 

This  iterative  scheme  is  modified  in 
STRIDE  bv  the  use  of  a  relaxation  factor 
I+H  IT),  where  H'  is  the  value  of  H 
at  which  (I-HJ)  was  last  evaluated  and 

:ored.   The  rationale  behind  this  mod- 
ification is  that,  if  u  is  an  eigenvalue 
of  J  then  the  corresponding  eigenvalue  of 
an  iterative  scheme  based  on  R(I-HJ)"1  is 
1- R( 1-Hu)/ (1-H1 u) ,  and  the  supremum  of 
this  for  all  u  in  the  negative  half-plane 
is  minimized  when  R  is  given  the  value 
we  use.   This  minimum  value  is  in  fact 
equal  to  | (H-H ' ) / (H+rT  ) | < 1  so  that,  for 
slowly  varying  Jacobians,  eventual  con- 
vergence would  be  achieved  no  matter 
how  much  H  varies. 

For  non-stiff  problems  (1.4)  is 
solved  using  a  fixed  point  iteration, 
applied  either  a  fixed  number  of  times  or 
iterated  to  convergence. 

Error  estimates  are  provided  by  em- 
bedding the  s-stage  (order  s)  method  in 
an  s+1  stage  (order  s  +  1)  method.   As  a 
bonus,  error  estimates  are  also  provided 
for  methods  with  orders  from  1  up  to  s 
(and,  if  H  has  remained  unchanged  for  two 
steps,  up  to  s+1) .   Thus  we  are  able  to 
implement  the  methods  in  a  variable  order, 
variable  step  size  manner. 


A  complete  description  of  the 
gorithm  STRIDE  is  given  in  [1],  wh 
the  theoretical  aspects  are  presen 
[2].  Only  a  few  basic  parameter  d 
tions  will  be  given  here.  The  pro 
STRIDE  has  only  the  four  arguments 
FN,  JAC  and  VALUES.  N  is  the  dime 
of  the  problem,  FN  and  JAC  are  pro 
for  the  evaluation  of  f(x,y)  and  J 
and  VALUES  is  a  user  defined  proce 
by  way  of  which  all  information  is 
changed  between  the  method  and  the 
Amongst  the  parameters  of  VALUES  i 
the  integer  variable  METHOD.  The 
digits  nin2n3  of  METHOD  indicate  t 
STRIDE  the  following: 


n,=l 


=  0  - 


error  per  step  is  to  be  con 
error  per  unit  step  is  to  b 
trolled 

Newton  iteration  with  an  in 
approximated  Jacobian 
Newton  iteration  with  a  use 
supplied  Jacobian 
Fixed  point  iteration,  iter 
convergence 

fixed  point  iteration  appli 
fixed  number  of  times, 
n,  indicates,  for  n2*0,  the  maximum 


n2  =  3 

=  2 
=  1 
=  0 


al- 

ile 

ted  in 

escrip- 

cedure 
N, 

nsion 

cedures 

(x,y) , 

dure 
ex- 
user. 

s 

three 

o 

trolled 
e  con- 

ternally 

r 

ated  to 

ed  a 

number 


of  iterations  permitted  whereas,  for  n 

it  indicates  the  number  of  iterations  to 
take  place. 

Finally,  in  incorporating  the  Lncre 
ments  to  both  X  and  Y,  the  effect  of  round 
off  error  is  reduced  by  computing  resi- 
duals equal  to  the  intended  increim 
minus  the  actual  changes  in  the  quantities 
being  updated.   These  residuals  are  added 
to  the  increment  for  incorporation .into 
the  following  step.   Hence,  if  <     i   the 
increment  in  Y,  we  perform  the  following 
steps  in  updating  Y: 

RES:=RES+AY; 
TEMP:*Y;Y:=Y+RES; 
RES:  =  RES-  (Y-TEMP)  ; 

2.   The  Experiments   In  this  section  we 
describe  several  experiments  performed 
with  STRDIE  using  the  following  problems. 

Problem  1:   The  equations  of  motion  for  a 
satellite  moving  under  the  influence  of 
the  earth  and  the  moon  in  a  coordinate 
system  rotating  so  as  to  keep  the  posi- 
tions of  the  earth  and  the  moon  fixed  [6]. 

y"  =  2y2+yi-u' (yl+u)/r?-u(yi-u')/rl 

y"  =  -2y'i+y2-u'y2/r^-uy2/rl 

yi(0)  =  1.2,  y!(0)=y2(0)=0, 

y2(0)=-l. 0493575098. . .   where 

r.  =  [(yi  +  u)2  +  yi]\  r2=[(yi-u<)2+y22}h 
u  =  1/82.45   and  u'  =  1-u. 

These  initial  conditions  produce  a  closed 
orbit  with  period  T=6 . 19 2169 3313 .. . 

The  solution  is  required  over  one  period. 

Problem  2:   A  Chemical  kinetics  problem 
[7,  pp  268-269] 

yi  =  -  .04y1  +  10'*y2y3 

y2  =  .04y1-10'*y2y3-3-107yi 

y3  =  3-107y! 

with  y,(0)=l,  y2(0)  =  y3(0)  =  0. 

Results  are  required  at  t=4#10  , 
k=0,l,...,9. 

The  first  problem  is  non-stiff,  and 
is  a  good  test  of  the  stepsize  control. 
The  second  problem  is  stiff,  and  is  fre- 
quently used  as  an  illustrative  example. 

Experiment  1:   To  investigate  the  relative 
efficiencies  of  the  non-stiff  options, 
problem  1  was  solved  using  METHOD  =  101, 
102,  112,  113  with  EPS=10"5,  10-8.   The 
number  of  function  evaluations  required 
were 
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IODj         LOJ    112.   113  D\  DG 

;   1070   L882   1882   1882    846 

10"7   1687   3165   3548   3193   1288 

These  results  indicate  that  for  this 
problem  there  is  little  use  in  doing  more 
than  one  fixed  point  iteration.   This  has 
proved  to  he  the  case  for  all  other  non- 
stiff  problems  that  have  been  tested,  and 
thus  METHOD=101  would  seem  to  be  a  natural 
choice  for  such  problems.   It  should  be 
noted  however  that  this  method  required 
considerably  more  evaluations  than  Krough's 
DVDQ. 

Experiment  2:   To  test  the  effectiveness 
of  the  accumulation  of  residuals  to  mini- 
mize round  off  error,  problem  2  was 
solved  with  METHOD  101,  and  a  realtive 
error  tollerance  of  10"8.   The  results,  at 
the  end  of  one  period  were: 

vi       Y2        >'3         yif 
R  1.200000"  3.08(-6)  -2.58(-6)  -1.0493583 

R  1.2000012  4.48(-6)  -2.60(-6)  -1.0493587 

The  results  show  a  slight  improvement  with 
the  acculumlation  of  residuals  (R) . 

Experiment  5:   To  compare  the  effects  of 
restricting  the  Newton  iteration  to  a  max- 
imum of  2,  3,  or  4  iterations,  problem  2 
was  solved  with  METHOD=122,  123,  124  and 
EPS=10~6.   The  following  are  the  number  of 
function  and  Jacobian  evaluations  at 
t=4xl09: 


METH0D= 


122 


#FN 
UAC 


12  59 
50 


125 

1161 

38 


124 
1199 


EPISODE 
620 
105 


a  maximum  of 
Included  here 


For  this  particular  problem, 

three  iterations  seems  best. 

are  results  using  Hindmarsh's  EPISODE, 

which  required  far  fewer  function  evalua 

tions,  but  considerably  more  Jacobian 

evaluat i  on . 


Experiment    4:      The    J 
whenever   any   one    of 
described    in    section 
was    found   by    monitor 
solving   problem   2    th 
convergence    rate    ace 
of    the    evaluations, 
large    change    in   H   ac 
remainder.       Letting 
value    of    H    for   which 
tluated,    (I-HJ)    is 
H ' /H    no    longe  r    ]  i 
In    thi  n t  , 

■      HOD=133   and 
i gned    the 
11/2,2)    and    [1/3,3]  , 
■Jacobian   was    used 
function    < 

of    evaluations    of    bo 
t j -4  .  10 

i        [3/4,4/3]        [2/3,3 


acobian    is    evaluated 
the    three    conditions 
lis    satisfies .       I t 
ing    the    program   while 
at    an    unsatisfactory 
ounted    for   about    1/3 
and    a    sufficiently 
counted    for    the 
H'     represent    that 
(I-H.JJ    was    last 
re-computed  when 
i  n    the    interval     [ h , 2] 
problem    2    was    solved 
EPS=10"6.      I    was 
[3/4,4/3],     [2/3,3/2], 
and    a    numer i  en 1 
that    the    number    of 
I        the    number 
th    f    and    J.       Results 
',     i-0,|,...i. 

/2|   [J/2,2]  |1/3,3| 


1 

3  36 

2 

49  0 

3 

672 

4 

820 

5 

963 

6 

1059 

7 

1135 

8 

1196 

9 

1229 

298 

525 

701 

837 

968 

1063 

1170 

1225 

1287 


375 

550 

738 

934 

1056 

1159 

1273 

1324 

136  7 


3  39 
526 

968 
1168 
1310 
1391 
1456 
1548 


For  this  problem,  the  two  smaller  inter- 
vals seem  more  appropriate. 

Experiment  5:   To  determine  the  effective 
ness  of  the  relaxation  factor  R  in  the 
Newton  iteration,  problem  2  was  solved 
using  METHOD*] 23,  EPS=10"5,  10"8  and 
R=2/(l+H/H')  and  1  (no  relaxation  factor). 
The  results  are  given  at  t=4xl05.   The 
number  of  function  evaluations  (#FN), 
Jacobian  evaluations  (#JAC) ,  divergent 
iterations  (#DIVGD)  and  non- convergent 
iterations  (#  CONVGD)  were: 


EPS=10  J 
R=2/(l+H/H')   R=l 


EPS=10 
R=2/(l+H/H')   R=l 


#FN 

675 

832 

1975 

2243 

#JAC 

27 

33 

45 

55 

#DIVGD 

5 

3 

7 

12 

#  CONVGD 

3 

13 

12 

20 

For  both  values  of  EPS  substantially  more 
work  was  required  in  the  case  where  no 
relaxation  factor  was  used. 
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EXPONENTIAL  INTERPOLATIONS  AND  THE  NUMERICAL  SOLUTION  OF  O.D.E   (SUMMARY) 

ARIEH  ISERLES 
King's  College,  Cambridge. 


Exponential  approximations,  mainly  the  Pade 
approximations,  N-approximations  and  exponentially 
fitted  approximations,  have  attracted  great  interest 
during  recent  years  as  a  means  for  the  numerical 
solution  of  ordinary  differential  systems. 

The  common  feature  of  these  approximations  is 
that  they  fit  to  the  exponential  and  to  its 
derivatives  at  the  origin.   Hence,  they  are,  in 
fact,  Hermitian  interpolations  and  not 
approximations.   The  high  degree  of  interpolation 
at  the  origin  is  of  great  benefit  if  (in  the  case 
of  either  linear  or  mildly  nonlinear  autonomious 
differential  systems)  the  behaviour  of  the  solution 
is  determined  by  a  single  principal  eigenvalue  of 
the  Jacobian  matrix. 

From  the  computational  point  of  view,  this  is 
not  the  case  if  parabolic  or  hyperbolic  partial 
differential  systems  are  solved  by  the  method  of 
lines.   The  solution  of  the  derived  linear  ordinary 
differential  systems  is  determined  (unless  the 
solution  is  in  an  asymptotic  stage)  by  a  great 
number  of  eigenvalues,  in  the  neighbourhood  of  the 
origin.   Even  if  the  exact  spectrum  of  the  matrix 
is  unknown,  it  is  a  useful  practical  assumption 
that  the  eigenvalues  are  more  or  less  spread 
uniformly  in  an  interval    [-T,  0]   .   Out  of  these 
eigenvalues,  only  a  smaller  subset,  contained  in 
[-T0,  0]   ,   0  <  T0  «  T,  influence  strongly  the 
numerical  solution  of  the  system.   Hence,  it  is  of 
practical  interest  to  investigate  the  use  of 
exponential  approximations  which  fit  to  the 
exponential  along  an  equispaced  mesh  in   [-T  ,0]  , 
instead  of  high-degree  interpolation  at  the  origin. 

Such  exponential  interpolations  are  derived  in 
this  paper. 

The  first  result  extends  the  C-polynomial 
theory  of  N^rsett  to  the  interpolatory  case: 

A  1         k 
Theorem  1:   If   p(x,c)  =  £  — ,  p  (c)  x  ,  where 

k=o 

c  >  0,  p  (c)  -   1  and  p  (c) ,  p  ,(c)  are 

'  rn  ro  rn-l 

arbitrary  functions  of  c  and 


(1)   R(x,c,p):= 


Y,  (-l)kDn"kp(0,c)(l-e"C)k(-x/c). 

k=o         x ; 

£  (-l)kDn"kp(l,c)(eC-l)k(-x/c). 
k=0     x  k 


where  (-y).   is  the  factorial  function,  (-y)   =  1, 

rC  o 

(-y)k  =  (-y)(-y+l)  (y+k-1)  for  k  ^  1  and 

D   p(a,c)   is  the  (n-k)-th  derivative  of  p(x,c)  at 
(a,c),  with  respect  to  x,  then 


R(qc,c,p)  =  e 


■qc 


0  $  q  N<  n  . 


Definition :   The  function  p(x,c),  as  defined  in 
Theorem  1,  is  called  the  C-function  of  the 
exponential  interpolation  (1).   If  p(x,c)  is 
continuous  for  c  >  o  (or  an  interval  0  <  c  <  C)  and 
lim  p(x,c)   exists,  the  C-function  is  said  to  be 
c-»o+' 
smooth. 

Theorem  2:   If  p   is  a  smooth  C-function,  let 
p(x)  =  lim  p(x,c)   and  let  R(x,p)  be  the 

c-»  o+ 
exponential  approximation  which  is  generated  by  the 
C-polynomial   p(l-x).   Then,  if  for  all  suffici- 
ently small  c,  and  for  a  positive  integer  s, 

—  Q  C 
R(qc,cp)  =  e     ,   O^q^n+s,   then 

~ ,   ,».     -x    f]   ,  n+s+1) 
R(x,p)  -  e   =  U   (x 

Theorem  3:   Every  rational  function  R  (x)  = 

P  (x)/Q  (x) ,  that  satisfies  the  conditions 

deg  Pc  (   n,  deg  Qc 

0  i.   q  4   n,      can  be  expressed  in  the  form  (1),  and 


n,  and  R  (qc)  =  e-clc, 
<pressed  in  the  foi 
the  corresponding  C-function  is  unique.   Moreover, 
if  R   is  continuous  i 
C-function  is  smooth. 


if  R   is  continuous  in  the  parameter  c,  then  its 


This  theory  of  C-functions  makes  it  possible 
to  derive  explicit  formulae  for  several  families  of 
exponential  interpolations,  corresponding  to  the 
known  formulae  for  exponential  approximations:  two 
important  cases  are  given  in  the  next  two  theorems. 


Theorem  4:   If 


P    (x,c):: 

n,m 


Q    (x,c):: 
n,m 


Y,  (n+m-k):  m: 

,^  (n+m).'k:(m-k).' 
k=o 


"    (n+m-k)!  n! 

)-•   (n+m).'k:  (n-k): 
k=o 


(l-e"C)k(-^), 
c  k 


(l_ec)k(3), 
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and  R    (x,c):=  P    (x,c)/Q    (x,c)   then 
n  ,tn         n  ,m       n  ,m 

/   \     -qc 
R    (qc)  =  e  n  ,   0  .<  q  <  n+m  . 
n,m  N 

This  rational  approximation  is  analogous  to  Pade 

approximation,  because  all  the  parameters  of  R 

.  r   •       ,  ...      n.m 

are  used  to  satisfy  interpolation  conditions. 

Therefore  we  call  this  case  "Pade  interpolation". 
Theorem  5:   If  a  >  0, 

P    (x,c,a):=     Zh     (£   (-l)P(k)(l+acp)VpC)(--), 
n  ,       k.       *-"  p  c   k 

k=o  p=o 

and     R   (x,c,a):=   P   (x,c,a)    /    (l+ax)n 
n  n 

then      R    (qc.c.a)    =   e  ,      Ojcqxn 

n 

Further,  if  a   satisfies 
o 

»-,  ,  ,  ,k,n+L  .,    ,  .n  -kc 

J]   (-1)  (  .  )(l+a  ck)   e    =  0  , 

k=o 

then  R  ((n+1) c ,c ,a)  =  e        ("N- interpolations" ) . 
n  * 


A-acceptability :   As  yet,  no  general  conditions 
have  been  found  that  imply  A-acceptability. 
Various  computer  results  are  available,  concerning 
the  range  of  c  which  yields  A-acceptability  and 
A  -acceptability  for  various  Pade  and 
N- interpolations. 


Arieh  Iserles 
King's  College 
Cambridge  CB2  1ST 
England. 
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On  Che  Existence  of  Maximum  Polynomial  Degree  Nordsleck-Gear 
(k,p)  Methods  for  All  (k,p) 

Roy  Danchick  and  Mario  L.  Juncosa 

The  Rand  Corporation 

Santa  Monica,  California  90406 


Introduction 

This  paper  sketches  a  fundamental  theorem  on 
the  existence  ot  Nordsieck-Gear  (k,p)  methods  of 
maximal  polynomial  degree.   The  notation  (k,p) 
refers  to  the  number  of  scaled  derivatives  re- 
tained by  the  method  (k)  and  the  order  of  the 
differential  equation  (p) .   The  method's  poly- 
nomial degree  is  the  highest  degree  polynomial 
that  can  be  numerically  integrated  exactly;  given 
exact  starting  values. 

In  [1]  it  was  shown,  for  fixed  stepsize  and 
for  sufficiently  accurate  starting  values,  that  if 
a  certain  sequence  of  corrector-predictor  coeffi- 
cients was  well-defined,  then  the  Nordsieck-Gear 
(k,p)  methods  could  obtain  a  maximum  polynomial 
degree  of  k+1.   The  sequence  was  shown  to  be  well- 
defined  in  the  (k,l)  case  for  k=2,3,A,  and  5. 
These  results  led  to  the  conjecture  that  such  a 
well-defined  sequence  exists  for  all  (k,p).   The 
results  of  this  paper  prove  that  conjecture. 

Basic  Definitions 

I   is  the  (k+l)x(k+l)  identity  matrix 

,(n+l)    r,(n+l) 

*0    ,'-v~2, 


^"■i0!  *  ^«»«>^i>-40«<n+1! 

The  stability  coefficients  I    , I      . . . ,1     are 
chosen  so  that  the  characteristic  polynomial  of 
(I-rn+1)e^)Ak  is  (X-l)Xk.   The  polynomial  degree 


(n+1)  , 


0 


is  computed  from 


maximizing  coefficient  £ 
the  difference  equation 
SQ  =  0 

Sn+1=a-^n+1)e^)[ASn-{(k«)}], 


according  to 

£(n+l) 
0 


(1-e^AS  )/(k+l-eLvS  ), 
On        In 


(2) 


(3) 


(2)  following  from  a  recursion  argument  on  the 

—  k  — 

error  in  solving  the  ODE  y'(x)  =  (k+l)x  ,yn  =  0. 

We  now  sketch  the  proof  that  the  choice  of 
the  polynomial  degree  maximizing  coefficient 

£_     according  to  (3)  can  always  be  made. 


eQ  =  [1,0.0,. ...0]  , 


ex   =  [0,1,0,..,  0]  . 


Definitions 


A,  is  the  (k+l)x(k+l)  Pascal  triangle 


matrix  with  element  a.,  given  by 


•ij  - 


<;> 


if  0  <  i  <  j  <  k, 
if  0  <  j  <  i  <  k, 


(t1;1)]^^),  (k;l),...,(k;l)]T. 


Preliminary  Discussion 

The  Nordsieck-Gear  (k,l)  method  for  the 
numerical  solution  of  the  scalar  O.D.E., 

y'(x)  =  f(x,y(x)),  y(0)  =  yQ, 

for  fixed  stepsize,  h,  can  be  written  as 
„(0) 


n+1 


AY 


(1) 


I  is  the  kxk  identity  matrix, 


I   =  t^,^,...,^]  , 


e1-  [1,0, ...0] 


A,  is  lower  kxk  diagonal  submatrix  of  A,  , 

{(if)}=  [(kt1),(k^1),...,(k:1)]T. 

K  is  the  kxk  Jordan  canonical  form  companion 

matrix  to  the  polynomial  X  . 

P  is  the  kxk  permutation  matrix  of  columns 
of  the  identity  matrix  written  in  reverse  order 

V,  ,  is  the  Vandermonde  matrix  with  element 
k-1 

v.,  =  ij(i=0,l,2 k-l,j=0,l,2,...,k-l), 

_ij  j 

e   =  [1,1 1]  • 
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Theorem 

Maximum  Polynomial  Degree  Nordsieck-Gear 
(k,p)  Methods  exist  for  all  (k,p)  . 

Sketch  of  Proof 


rP+1, 


The  idea  behind  the  proof  is  to  reduce  the 

lower  kxk  submatrix  of  A,  (I-£     e  )  to  Jordan 

T 
canonical  form  leaving  e  AS   invariant  and  to 

show  that  this  term  is  non-positive.   Since  A, 

is  upper  triangular  the  problem  can  be  reduced 
by  one  in  dimension  by  defining  the  sequence. 


0  =  0, 

S  .   =  (l"-ee^)[AS  -{(^1)}j. 
n+1        1    n    1 


(A) 


Let  W  =  L,  Q,  A, S  .   Here  Q   is  a  unique  upper 
n    k  k  k  n         n 

— T 
triangular  matrix  with  first  row  equal  to  e..  .   It 

—  —  —  T 
takes  A^(I-2.e  )  into  the  form 

_ TT    _1  V-L1     _T 

\  =  \\(I-£ei)Qk    =I+K-{(i)}ei   •        (5) 


\=  (WVp^^V^ 


and  that  for  the  (k,p)  case 


{(k+i)}  =  (k+i)diag  [(p)>(p+i))__>)C).j 


k  -1  -1 

P     k-p 


This  reduces  the  general  (k,p;  case  to  the  (k-p,l) 
case. 

References 

(1)   Danchick,  R. ,  "On  the  Non-Equivalence  of 
Maximum  Polynomial  Nordsieck-Gear  and 
Classical  Method,"  Procedings  of  the  Confer- 
ence on  the  Numerical  Solution  of  Ordinary 
Differential  Equations  17,20  October  1972, 
University  of  Texas  at  Austin,  Lecture  Notes 
in  Mathematics,  Springer,  Vol.  362,  No.  VIII, 
pp.  92-106. 


B  is  similar  under  P  to  the  companion  matrix  of 

k 
the  polynomial  X  .   L  is  a  unique  lower 

-T 
triangular  matrix,  whose  first  row  is  e. ,  that 

takes  B  into  K  according  to 


K  ■  Wk1 


(6) 


Under  L,  Q  A,  the  S  sequence  goes  into 

Vi  =  K[vLkV(kt1)}J  • 


(7) 


Moreover 

e^W      =   ejx,  Q,  A,  S      =   e^Q,  A,  S      =   e.A,  S      =e^A,  S    .(8) 
In  1  kxk   k  n  lvk   kn  1T(  n        lien 


The  theorem  is  proved  if  it  can  be  shown 

A  sufficient 


that  efw  =  <  0  for  n  =  0,1,2, 
In   - 


condition  for  this  sequence  of  inequalities  to 
hold,  given  (4),  is  that 

LkV(1?)}i°  •  <9> 


This  is  accomplished  by  showing  that 

LkV(kl1)}  =  (k+1)LkVklg 


(10) 


and,  by  induction  on  k,  that 

w;1  -  °  ■ 


(11) 


passage  to  the  k  >  p  >  1  case  is  made 
by  showing  that  the  general  form  of  the  Q, 

associated  with  a  (k,p)  method  is 


23-2 


TWO  DEVICES  FOR  IMPROVING  THE  EFFICIENCY 
OF  STIFF  ODE  SOLVERS 

Peter  Alfeld 

Department  of  Mathematics 

University  of  Utah 

Salt  Lake  City,  Utah 


ABSTRACT 

The  nonlinear  systems  of  equations  that  a- 
rise  when  a  stiff  system  of  ordinary  differential 
equations  is  tackled  by  an  implicit  linear  multi- 
step  method  is  normally  solved  using  a  modified 
version  of  Newton's  method  in  which  the  Jacobian 
is  kept  constant  for  as  many  steps  as  possible. 
An  initial  approximation  is  obtained  by  applying 
a  predictor,  typically  an  explicit  linear  multi- 
step  method.   In  this  talk,  two  approaches  to 
improving  the  efficiency  of  the  above  procedure 
are  described. 

Firstly,  it  is  suggested  to  use  quasi-Newton 
methods  that  update  the  Jacobian  approximation 
at  each  iteration-step  for  the  solution  of  the 
nonlinear  system.   It  is  argued  that  the  most 
popular  such  method,  Broyden's  first  method,  is 
unsuitable  for  the  special  nonlinear  systems 
under  consideration,  and  that  instead  Broyden's 
second  method  should  be  used.   That  argument 
depends  very  critically  on  the  different  roles 
that  are  played  by  the  eigenvalues  (of  the 
Jacobian)  with  large  and  small,  respectively, 
absolute  values.   Computational  aspects  are  con- 
sidered, and  numerical  examples  are  given. 

Secondly,  it  is  suggested  to  improve  the 
initial  approximation  by  interpolating  correc- 
tions that  were  applied  to  previously  obtained 
initial  approximations.   This  device  is  motivated 
by  considering  the  test  equation  y'  =  X  .   Nu- 
merical examples  suggest  that  it  works  very  well. 


1.   INTRODUCTION 

When  an  implicit  linear  multistep  method 

is  applied  to  a  stiff  initial  value  problem 

y'    =    f(y)    ;    y(a)    =   n  e   Rm    ;    x  6    [a,b]    (2) 


Fn(W    =  yn+k'hBnf(Vk'Vk) 


k-1 

+     I     (a.y       -hB.f   ..)   =  0 
jtg       J    n+J        J   n+j' 


(3) 


has    to  be    solved  at   each    step   of   the   integration 
(1). 


2.      THE   USE   OF   QUASI-NEWTON  METHODS 

In   this    talk,    it    is  proposed  to   solve   the 
nonlinear    system   (3)    by  a   rank-one  quasi-Newton 
method   defined   by: 


1)      yn+k        given   by    some  prediction  operator. 
Let      r   =   0    . 


2)      Solve      B^slfj  =   -  F    (yf?J)     where      B[rJ 
n+k  n+k  n  yn+k  n+k 


frl 
is  an  approximation   to      F'(y    ,,) 

n     n+k 


3)      Let     y[lt1]   =   ylrl  +  s[*} 
'n+k  'n+k  n+k 


u[r]        =        (    Ir+llj.p    (    [r] 
n+k  n^yn+k      ;      n^yn+k; 

(4) 

[r+1]   m      [r]         ^n+k     n+kSn+kMVn+k; 
n+k  n+k       "  [r]   T      [r] 

(Sn+kj      Vn+k 


[r] 
(where     v   ,,       is   specified  below) 
n+k 


4)      Check  for   convergence.      If   not   satisfied 
let      r  =   r+1      and  go  back   to   step    2). 


then  a  nonlinear   system  of   equations 
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A   detailed   discussion  of    quasi-Newton  methods 
may   be    found    in    [1],     [3],     [A],     [6].      Among 
others,    the   following   properties  are   important 
in  the   present    context:      1)    Quasi-Newton  methods 
converge  superlinearly.      2)   The  update  of    the 
Jacobian   can   be   replaced   by   an  update   of    the 
inverse  Jacobian. 

Normally,   a  version  of  Newton's  method    is 
used    for   the  numerical   solution   of    (3).      The 
Jacobian  is  kept  constant    for  as  many    steps  as 
possible,    but  when    it    has    to  be   reevaluated,    or 
reestimated,    this   requires   additional  function 
evaluations.      In  a   quasi-Newton  method,    this 
expensive  procedure  is  replaced  by  a   cheap   up- 
date of   the  Jacobian  at   each   iteration  for    the 
solution  of    the  non-linear    system    (3)o      This    is 
more  efficient  as   it   requires  no  additional 
function  evaluations. 

Broyden    [3]    suggests    the   following  choices 
of   the  update    (4) : 

v[r]      =      g[r]       (Broyden's    first   method)  (5) 

n+k  n+k 

vIrl      =      (b      , )Tu    ,,     (Broyden's    second  method) 
n+k  n+k        n+k 

(6) 


Broyden's  first  method  is  generally  consid- 
ered the  better  general  purpose  method    (see 
[3],  [5]). 

However,  in  [1]  it  is  argued  that  for  the 
special  nonlinear  systems   under  consideration 
Broyden's  second  method  is  more  suitable. 

For  lack  of  space  the  argument  can  only  be 
outlined  here.   (A  full  description  may  be 
found  in  [1].)   It  is  based  on  considering  the 
different  roles  that  are  played  by  the  eigen- 
values of  the  Jacobian  F'   with  large  and  small, 

n 

respectively,  absolute  values.   The  argument  can 
be  summarized  in  three  points: 

1)  A  prediction  y  ,,   can  be  obtained  that  is 

r  ;n+k 

particularly  accurate  in  components  corre- 
sponding to  eigenvalues  of  large  absolute 
value. 

2)  Any  quasi-Newton  method  reduces  in  a  cer- 
tain sense  to  the  secant  method  applied  to 
each  component  (corresponding  to  one  of 
the  eigenvalues)  of  the  solution  of  (3). 
For  the  secant  method,  the  error  at  the 
r-th   stage  is  roughly  proportional  to  the 
previous  two  errors  and  inversely  propor- 
tional to  the  first  derivative  of  the 
function  under  investigation.   In  the  pre- 
sent context,  this  means  that  the  error  in 
the  component  corresponding  to  an  eigenvalue 

,  say,  is  roughly  proportional  to  the 

two  previous  errors  in  that  component  and 

Inversely  proportional  to  A,  . 


The  above  points  show  that  the  eigenvalues 

of   F'   with  large  absolute  value  have  little 
n 

importance  for  the  convergence  of  the  iteration 

for  the  solution  of  (3).   (They  do  matter  for 

stability).   The  key  point  is  the  following: 

3)   Broyden's  second  method  approximates  partic- 
ularly well  the  eigenvalues  of  small 
absolute  value.   More  precisely:   Broyden's 
first  method  minimizes  (over  all  rank-one 
updates)  a  sharp  bound  on  the  error  of  the 
eigenvalues,  whereas  Broyden's  second 
method  minimizes  a  similar  bound  on  the 
reciprocals   of  the  eigenvalues.   Since  these 
bounds  are  independent  of  the  eigenvalues, 
Broyden's  first  method  yields  small  relative 
errors  of  the  large  eigenvalues,  and  Broyden's 
second  method  yields  small  relative  errors  of 
the  small  eigenvalues  (large  and  small  in 
absolute  value) . 

In  [1],  numerical  examples  (taken  from 
[7])  are  given  that  confirm  that  quasi-Newton 
methods  are  strong  competitors  of  Newton's 
method,  and  that  indeed  the  update  (6)  performs 
better  than  the  update  (5)  (in  the  present 
special  context). 

Numerical  aspects  considered  in  [1]  include 
the  following: 

[r] 

1)  Rather  than  updating   B      in  (4),  we  can 

n+k 

fr]  -1 
update   (Bl, £)    >  thus  saving  an   LU- 

f actorization  at  each  step. 

2)  When  the  step-size  or  order  of  (1)  is 
changed,  then  it  is  a  viable  strategy  not 

to  change  the  Jacobian  approximation  at  all. 
The  basis  for  this  device  is  the  fact  that 
the  smaller  the  absolute  value  of  an  eigen- 
value, the  less  it  is  affected  by  such  a 
change. 


3.    THE  USE  OF  INTERPOLATION 

In  [2],  the  following  modification  of  the 
normal  prediction  process  is  described: 

y      given  by  an  explicit  linear       (7(i)) 


multistep  method 


'n+k     yn+k  +   n+k 


(7(ii)) 


y  ,,    obtained  by  applying  a  variant     (7(in)) 
n+k    c  .,  ,     ,  .  . 

of  Newton  s  method  or  a  quasi- 
Newton  method  to  (3). 


d  ,,   is  obtained  by  interpolating 
n+k 

dn+J  "  yn+j  "  yn+j  (J  <  k)  •   This  approach 

depends  on  d  ,.   being  a  smooth  function  of 

n+J 

x    .   Using  an  asymptotic  expansion  of  the 
n+j 
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.runout  ion  error  tor  investigating  this 
Ther.         ie  test  equation 

y'  ■  Vy  (8) 

nsidered.      It    turns  out    that    the  corrections 

d      ,       show   the  same  qualitative  behaviour  as 
n+k 

the  numerical   solution  of    (8)    by    (1). 

It    follows   that,    for   example,   Gear's   back- 
ward  differentiation  formulas    (see    [8])    are 
suitable   for    the  approach    (7).      On   the  other 
hand,    the  Trapezoidal   Rule  leads    to   oscillating 
global   errors,   and   thus   to  oscillating  corrections 

d    . .    .      However,    these   oscillations  occur   only  for 
n+j 

eigenvalues   of   large  absolute  value.      Since   we 
have   seen   in   section  2.    that   only   eigenvalues   of 
small  absolute  value  matter   for   convergence,    it 
follows   that    the  device    (7)    can  also  be  applied 
to   the  trapezoidal   rule.      Moreover,    particular 
predictors  can  be   designed   that   dampen    the  oscil- 
lations  in   the  corrections. 

In    [2]   numerical   examples  are  given    that   con- 
firm  the  above  statements,    and   the  viability  of 
the  approach   (7) . 
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Abstract  —  Several  computational  methods  based 
on  the  backward  differentiation  formulas  have  been 
proposed  for  the  numerical  solution  of  stiff  systems 
of  ordinary  differential  equations.   These  methods, 
however,  suffer  from  serious  difficulty  in  that  they 
are  unable  to  cope  with  the  stiff  systems  in  the 
presence  of  the  eigenvalues  of  the  Jacobian  which 
are  close  to  the  imaginary  axis.   This  paper  expli- 
cates this  phenomenon  from  a  point  of  view  of  struc- 
tural stability.   A  necessary  and  sufficient  condi- 
tion is  presented  for  structural  stability  for  n- 
dimensional  system  of  linear  ordinary  differential 
equations  which  is  used  as  a  test  problem.   This 
proves  that  the  region  of  stiff  stability  contains 
structurally  unstable  systems.   The  poor  performance 
of  the  algorithms  when  some  of  the  eigenvalues  of 
the  Jacobian  are  close  to  the  imaginary  axis  is 
caused  by  the  presence  of  structurally  unstable 
systems.   Furthermore,  a  reliability  of  testing  for 
stiff  systems  is  discussed  from  a  point  of  view  of 
topological  equivalence.   A  practical  problem 
arising  from  dynamic  security  assessment  for  the 
large-scale  interconnected  power  system  is  also 
discussed. 

1.  Introduction 

The  purpose  of  this  paper  is  to  demonstrate 
importance  and  usefulness  of  the  qualitative  theory 
of  differential  equations,  i.e.,  the  theory  of 
structural  stability,  in  the  design  of  numerical 
algorithms  for  stiff  differential  systems. 

2.  Structural  Stability  for  n-dim.  LODEs 


lity  for  n-dim.  LODEs  and  present  a  clear  proof  of 
this  result  based  on  the  topological  equivalence  of 
linear  flows  [7]. 

We  shall  begin  by  explaining  a  general  concept  of 
"structural  stability".   A  flow  $  on  a  topological 
space  M  is  a  continuous  map: 


(J):  R  x  M 
such  that 

(i)  <t>(0,  x) 
(ii)  <|>(t 


M 


"!' 


(t2,  x))  = 


>(t  +  t2,  x) 


for  x  £  M  and  t  ,  t„  £  R,  where  R  is  the  real  line. 
A  trajectory  of  4>  through  x  £  M  is  a  canonically 
ordered  set  {(J)(t,  x)  |  t  £  R} . 

Definition  1:   Two  flows  4>  and  i)j  on  M  are  said 
to  be  topologically  equivalent  if  there  exists  a 
homeomorphism  of  M  onto  itself  carrying  each  trajec- 
tory of  (f)  to  a  trajectory  of  \p   together  with  preser- 
ving orientation. 

For  linear  flows  Definition  1  is  shown  to  be  equi- 
valent to  the  following  definition: 

Definition  1'  :   Two  flows  <J>  and  i/j  on  M  are 
said  to  be  topologically  c-equivalent  for  c  >  0  if 
there  exists  a  homeomorphism: 


M  ■+  M 


such  that 


h  •  <Kct,  x)  =  <Kt,  h(x)),  x  £  M,  t  E  R. 


After  the  introduction  of  the  concept  of  structural 
stability  by  Andronov  and  Pontryagin,  many  studies 
have  been  devoted  to  the  problem  of  structural 
stability  for  dynamical  systems  in  the  field  of 
pure  mathematics  (e.g.,  [12]).   It  seems,  however, 
that  there  is  no  description  of  a  necessary  and 
sufficient  condition  for  n-dimensional  linear 
ordinary  differential  equations  (LODEs)  to  be 
structurally  stable.   The  purpose  of  this  section 
is  to  investigate  the  concept  of  structural  stabi- 
lity for  n-dim.  LODEs.   First,  the  concepts  and 

itions  pertaining  to  the  structural  stability, 
which  are  useful  In  the  following  discussions,  are 
summarized  briefly.   Secondly,  we  give  a  necessary 
and  sufficient  condition  for  the  structural  stabi- 


*  ThlH  work  was  supported  in  part  by  the  Ministry 
.pan,  under  firant  259002  . 


Let..V(M)  be  the  set  of  all  C  -vector  fields  on  M. 
A  C  -vector  field  f  £  V(M)  generates  a  flow. 

Definition  2:  A  nbhd  of  f  £  V(M)  is  any  subset 

N(f)  in  V(M)  which  contains  a  set  of  the  form:{  g  £ 

V(M)  |  | | g  -  f | |  <  6  }  f or  6  >  0 ,  where  | | • | |  is 
some  norm. 

Definition  3:   A  vector  field  f  £  V(M)  is 
structurally  stable  if  there  exists  a  nbhd  N(f)  of 
f  in  V(M)  such  that  for  every  g  £  N(f),  the  flows 
of  f  and  g  are  topologically  equivalent. 

Now  we  can  show  a  new  result  pertaining  to  n-dim. 
LODEs.   Let  us  consider  a  family  of  n-dim.  LODEs: 

x  =  Fx,    x  £  R°,    F  c  L(Rn)         (1) 

which  is  used  as  a  test  problem,  where  L(R  )  denotes 
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n  mat  t  i  ha  follow- 

ing theorem: 

Theorem  1:   The  n-dim.  LODEs  (P  is  structural- 
table  if  and  only  if 

Re  \±(7)   t   0,        I n, 

Re  ^.(")  denotes  the  real  part  of  the  eigen- 
values of  the  matrix. 

Remark  1:   It  seems  that  there  is  no  descrip- 
tion of  a  necessary  and  sufficient  condition  for 
the  n-dim.  LODEs  (1)  to  be  structurally  stable. 
The  structural  stability  for  n-dim.  ODEs  was  inves- 
tigated in  [11]  which  however  depends  on  the  "c   - 
homeoraorphism" . 


Example  1:   Consider  the  4-dim.  LODEs: 

x  =  AAA_1x,    x  E  R4,    t  (  [0,  20/r,]  (2) 

A  -  diagli,  -i,  -ct+  Lui,  -a  -    Im],    I  >  0 

-1 
1 
i 


■1 
1 

-i 
i  -i 


-i 

-i 

1 

-1 


x(0)  = 


The  eigenvalues 

Xl,2  "  t1' 


3,4 


-a  +  im 


Hence,  by  Theorem  1,  the  LODEs  (2)  is  structurally 
unstable.   We  can  find,  Jn  fact,  tests  examples 
where  the  BDF  of  order  A  to  6  for  the  4-dim.  LODEs 
(2)  come  to  unstable  [14], 


Proof  of  Theorem  1:   First,  we  note  that  the 
vector  field  of  the  n-dim.  LODEs  (l)p|s  f(x)  =  Fx, 
the  flow  is  specified  by  41  ( C ,  x)  =  e   x,  and  V(M) 
is  the  set  of  all  n  x  n  matrices,  which  we  denote 
by  L(Rn).   The  nbhd  of  F  e  L(Rn)  is  any  subset  N(F^ 
in  L(Rn)  which  contains  a  set  of  the  form:{G  c  L(R  ) 
5)  for  some  5  >  0  and  | | • |  j  uniform 


Lemma  1  [7]:   Let  ■i   and  fy   be  flows  on  R  gene- 
rated by  x  =  Fx,  x  e  R  ,  and  y  =  Gy,  y  e  R  ,  respec- 
tively.  Then,  the  two  flows  41  and  ty   are  topologica- 
ls equivalent  if  and  only  if  In(F)  =  In(G)  and  G* 
»  cF*,  where  In(-)  is  the  inertia  of  the  matrix,  c 
is  a  positive  constant,  and  F*  denotes  the  real 
Jordan  form  of  the  matrix  F  corresponding  to  the 
eigenvalue  with  zero  real  part. 

We  are  now  in  a  position  to  prove  Theorem  1: 
(Necessity)   Assume  that  the  n-dim.  LODEs  (1)  is 
structurally  stable  but  Re  >  (F)  =  0  for  some  i. 


Chat 


By  Definition  3  it  follows  that  there  exists  a  nbhd 
N(F)  of  Ft  L(Rn)  such  that  for  every  G  E  N(F)  the 
flows  of  F  and  G  are  topologically  equivalent.   By 
choosing  any  nbhd  N(F)  of  F,  however,  there  is  an 
G  e  N(F)  which  does  not  satisfy  the  condition  in 
Lemma  1.   This  contradicts  to  Lemma  1.   Thus  we 
conclude  that  Re  A  (F)  t   0,  1  -  1,  ...  ,n,  for  the 
structurally  stable  n-dim.  LODEs  (1). 
(Sufficiency)   Suppose  Re  A . (F)  4   0,  i  =  1,  . .  .  , 
n.   Then  there  exists  a  nbhd  N(F)  in  L(R  )  such 
that  for  any  G  E  N(F),  Re  A  (G)  i   0,  i  «  1,  . . .  , 
n,  and  In(F)  =  In(G).   Hence,  by  Lemma  1  it  follows 
that  the  flows  of  F  and  G  are  topologically  equi- 
valent. 

3.   Stiff  Stability  Contains  Structurally  Unstable 
Systems 

The  most  popular  methods  for  solving  stiff  diffe- 
rential systems  are  based  on  the  backward  differen- 
tiation formulas.   The  crucial  drawback  of  these 
methods  is  the  poor  stability  property  when  the 
Jacobian  matrix  has  eigenvalues  close  to  the  imagi- 
nary axis  (e.g.,  [1],  [2],  [9],  [13]).   A  reason 
of  this  difficulty  can  be  interpreted  as  follows. 

First,  in  view  of  Theorem  1  and  the  definition  of 
stiff  stability  ([5],  [8]),  we  have  the  following 
theorem: 

Theorem  2:   The  region  of  stiff  stability 
contains  structurally  unstable  systems. 


To  summarize:   the  implicit  BDF  of  order  1  to  6 
are  stiffly  stable,  so  that  by  Theorem  2,  structu- 
ral instability  is  possible  for  the  BDF  of  order 
1  to  6. 

4.   Testing  —  Topological  Equivalence 

An  ultimate  aim  in  discussing  stability  of  numeri- 
cal methods  is  to  determine  the  stability  of  the 
methods  for  nonlinear  systems: 


x  =  f(x) ,    x  c  R  ,   x(0) 


0 


(3) 


The  property  we  would  like  to  demand  from  a  method 
is  that,  for  a  given  step-length  and  order,  the 
global  error  remains  as  small  as  possible.   In 
order  to  forecast  the  behavior  of  a  method  on  a  non- 
linear system,  it  is  customary  to  consider  the  test 
equation: 

.1 


x  =  Ax,   x  e  R 


(4) 


for  all  A  in  some  set  including  the  set  of  eigen- 
values of  the  Jacobian  matrix  J  =  3f/3x.   This  test 
problem  is,  however,  too  simple  to  give  a  reliable 
indication  of  the  performance  for  stiff  systems 
unless  some  additional  conditions  are  satisfied. 
Little  has  been  rigorously  proved  concerning  error 
propagation  in  the  numerical  solution  of  nonlinear 
stiff  systems  [1].   There  does  not  exist  a  general 
theory  of  stability  of  numerical  methods  such  that 
the  stability  behavior  of  the  method  when  applied 
to  nonlinear  stiff  systems  can  be  determined  in  a 
systematic  manner.   What  are  conditions  under  which 
the  (local)  behavior  of  the  nonlinear  stiff  problem 
(3)  is  determined  by  the  solution  of  the  linearized 
system?   This  question  is  partially  answered  by  the 
topological  equivalence  Lemma: 

Lemma  2  [6]:   The  local  behavior  of  the  NODE 
(3)  near  the  singular  points  is  topologically  equi- 
valent to  the  behavior  of  the  linearization  of  (3) 
if  and  only  if  ReA  (J)  +   0,  i  -  1,  ...  ,  n. 


Mo 
the 


reover,  let  tt  ,  tt  ,  and  tt  denotes  the  number  of 
e  eigenvalues  of  the  matrix  J  with  positive,  zero, 
and  negative  real  parts,  respectively.   The  topolo- 
gical type  of  singular  points  is  determined  by  the 
intergers  tt  ,  tt  ,  and  by  the  behavior  of  the  phase 
trajectories  on  some  invariant  submanifold  whose 
dimension  is  equal  to  tt  .   When  the  linearized 
system  has  an  eigenvalue  with  real  part  zero,  know- 
ing about  the  behavior  of  the  linearization  of  (3) 
at  singular  points  gives  no  information  about  the 
behavior  of  (3)  itself  near  the  singular  points. 
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Example  2:   Let  us  consider  a  practical  problem 
arising  from  dynamic  security  assessment  for  the 
large-scale  interconnected  power  system.   A  mathema- 
tical model  for  the  power  system  is  described  by 


6. 

1 


-  ETC  .  -  D.6. 


E  A. .cos(6 . 


J 
=  1, 


ij 


(5) 


One  of  the  problems  of  dynamic  security  assessment 
is  to  estimate  critical  clearing  times.   Hence, 
we  must  know  the  behavior  of  the  solution  of  the 
second-order  NODEs  (5)  which  is  possibly  unstable 
due  to  contingency.   If  the  effects  of  damping 
torque  is  approximated  by  "uniform  damping",  i.e., 


D./M. 
1   x 


C,     1*1, 

then  the  NODEs  (5)  result  in 


n, 


f(x)  -  Ci, 


x  e  R 


n-1 


(6) 


The  local  behavior  of  the  NODEs  (6)  near  singular 
points  is  topologically  equivalent  to  the  behavior 
of  the  linearization  of  (6)  if  and  only  if 


.e   A. 

l 

(K0)    *   0, 

'   0             I  " 

0   = 

•Jo    "CI- 

1, 


2n-2, 


On  the  other  hand,  when  all  eigenvalues  of  J_  are 
real,  to  each  negative  eigenvalue  of  J.  there 
corresponds  to  either  two  negative  real  eigenvalues 
of  K_  or  to  a  pair  of  complex  conjugate  eigenvalue 
with  real  part  equal  to  -C/2.   Thus,  in  the  case  of 


without  damping,  i.e. 
Re  >i(KQ)  = 


0,  it  is  possible  that 


0. 


Hence,  we  need  a  full  nonlinear  system  to  obtain 
information  about  the  behavior  of  (6)  itself  near 
the  singular  points. 

5.   Concluding  Remarks 

The  qualitative  theory  of  differential  equations, 
i.e.,  the  theory  of  structural  stability  is  an 
important  factor  which  must  not  be  over  looked  in 
designing  algorithms  for  stiff  differential 
systems.   It  may  be  hoped  that  the  theory  of 
structural  stability  makes  it  possible  to  give  a 
theoretical  foundation  to  the  idea  of  P.  E.  Zadu- 
naisky  in  estimating  global  errors;  and  may  be 
also  useful  to  study  the  dangerous  property  ([1], 
[10])  of  numerical  methods  for  stiff  differential 
systems  with  eigenvalues  which  change  from  large 
negative  values  to  large  positive  values  during 
computation. 
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neral  theory  of  high  order  cyclic  methods  is 
given  in  /l/  and  /2/;  in  this  contribution  we  want 
to  present  some  practical  consequences.  The  theory 
is  based  upon  the  result  that  the  functional  ("A- 

'') 


A-. 
- 


°snp 


•  si  J  K 


■ 
1-1 


\i   o.£Rs 
1       J 


(1) 


ty 


which  depends  upon  a  real  (s.s)-matrix  S  with 
|AJ|i?:const . ,  jflH,  is  (the  most  adaquate)  stabili 
functional  for  a  very  wide  class  of  discretization 
methods,  namely  the  A-dependent  methods  ("A-methods") 
defined  by 


(h); 
o  o 


z.=Az.   +h?(x.  ,,z.    .,z.;h),  j=l(l)p  (2a) 


where  ;  (h),z.eRS,  h£(o,h  ]<l(o,b-a]   and 
o  oJ—  J 

s  s  **  *   s 

T:  a,b  "R"  XR  v  o,h  -*R   is  continuous  and  _,satisf  les 


Ril' 


,rd' 


a  Lipschitz-condition  w.r.t.the  2   and  3   argument. 

Frequently  it  is  convenient  to  write  (2a)  in  the 
equivalent  form: 

z  =;  (h);  Lz.=L'z.   +hJ(x.  .,z.  ,,z.;h)        (2b) 
o  o       j   j-1     j-1   j-1   j 

with  det  L  4  o  .  Almost  all  commonly  used  methods 

can  be  reduced  to  these  matrix  forms.  As  an  example 

consider  the  following  linear  2-cyclic  2-step 

method: 

11  111 

.  -   jr,.  ,  ■  h(S_f  _. 
1  2j-l  o'2j-2      2  2 

2     2 


V2j- 


1  o  2j-2 


2j  l*Zj-l   o'2j-^      Z  ^j 

' 2j+l  l'2j   oy2j-l      2  2  *l  1  2j   o  Zj-1 

.  ■■,=',  (h)  j=l(l)p. 

o  o   11 

Combining  the  linear  terms  with  y„.  and  y  . 

the  left  side  and  all  other  terms  on  the  right 

side,  we  obtain  form  (2b)  with 

1 


/    2j  o  2j-2 

J  I  B2£    ♦  ...  *B2f„.   / 

>.  .2*1        o  2-1/ 


5) 


Multiplication  with  L   yields  form  (2a). 

Similarly,  any  linear  M-cyclic  k-step  methods  can 
be  reduced  to  the  forms  (2a/b)  defining  s:=max(M,k) 
and 


'  (h):  =  (o,...,o,n  ,n  (h) , n.  .(h))  •« 

o  o  1         k-1 


z . 

J 


'  (yMj+k-s,yMj+k-s+l yMj+k-l 


(3) 


In  the  case  M*k  we  have  to  add  (k-M)  "trivial 
steps"  of  the  form  VM-  +  i=yM'+'  '  1=°(l)k-M~4 »  before 
reducing  to  form  (2b7,  tin  the  special  case  M=l  one 
thus  obtains  the  well-known  one-step  representation 
of  a  linear  k-step  method,  where  A  is  a  Frobenius 
matrix) . 

An  analysis  of  the  functional  (1)  yields  the  fol- 
lowing theorem  which  is  related  to  results  of 
Stetter  111,    pg.314,  and  Skeel  /5/: 

Theorem: (a)  Let  6 .  dR  for  all  he(o,h  1  have  the 

7 J  ° 

form  ,  t     m\ 

6o=0(hq+1);   6ychVq+4)(x.)t4(hq+V   j=l(l)p   (A) 

with   qefrl,    teR      independent   of  h   and   j,    and  y 

bounded  in  [a ,b] . 

(b)  Let  the  eigenvalues  u.  of  A  satisfy 

Wt-l,  |u.|<  1  ,   i=2(l)s.  (5) 

Then  i|£  [oJ  =$'(hq+1)   if  and  only  if  one  of  the 
following  conditions  is  satisfied: 

rank  (I-A,t)=s-1  (6a) 

j  j.  y(q+1)(x  KC,   C  indep.  of  h  and  j.  (6b) 
1-1        L1 

Assumption  (5)  can  be  weakened  to  the  case  of 
multiple  eigenvalues  at  p=l,  and  still  other  gener- 
alizations are  possible  (see  111). 

A  f 

As  ik  is  stability  functional  for  all  A-methods 

with  local  discretization  errors  6.,  the  theorem 
states,  in  particular,  under  which  conditions  a 
composite  ivethod  with  stages  of  order  q  converges 
with  order  (q+1).  The  decisive  conditions  (6a)  and 
(6b)  will  be  called  "order  conditions";  especially 
condition  (6a)  turns  out  to  be  basic  for  the  con- 
struction of  high  order  composite  methods. 

I.,  whdt  follows,  we  shall  use  (6a)  for  the  con- 
struction of  stiffly  stable,  cyclic  methods  with 
maxima l_order.  Before  doing  so,  however,  let  us 
apply  our  result  to  certain  block-implicit  methods 
in  order  to  give  an  idea  of  the  usefulness  and  the 
wide  scope  of  application  of  the  order  condition. 
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Consider  the  methods 


"Mj-M+1 


yMj-M+2 


/Mj 


v- 


.o  1 


.o  1 


.o  1 


> 

( 

MJ-2M+1 

MJ-2M+2 

+h 

Mj-M      / 

K 

oMj-M4 


B4f  • 

M  Mj 


?  2 

If  .   +...+S  f  . 
U  Mj-M     Pl  Mj 


^p0  Mj-M 


3Mf 
M  Mj 

L 


j-i 


with  j=l(l)p,  z  =(o, 


,o,n  )  dR  . 


+h   <2>(x.  „  ,z.  „  ,z.  ;h) 


M 


o   '      '  '  o 

These  one-step  methods  have  been  previously  studied 

by  Watts  and  Shampine  /9/  and  by  Bickart,  Burgess 

and  Sloate  IV,    Ibl. 

Let  us  assume,  that  all  stages  have  order  q  except 

the  last  one  which  is  supposed  to  have  order  (q+1); 

then  t=(c    •  ...  ;c  .,;o)   (c   .  :  error  const. 
q+1         q+1        q+1 

of  the  r   stage)  and  the  method  converges  with 
order  (q+1)  because  (6a)  is  satisfied. 


Another  application  of  the  order  condition  (6a)  is 
the  following:  If  we  construct  a  k-cyclic  k-step 
method  with  (unstable)  stages  of  order  (2k-l), 
determining  the  k  free  parameters  such  that  the 
method  is  stable  and  satisfies  (6a),  then  we  obtain 
Donelson  and  Hansen's  /4/  k-cyclic  k-step  methods 
of  maximum  order  2k.  Thus,  our  theorem  provides  a 
rather  simple  and  straightforward  approach  to  them, 
discarding  the  concept  of  "auxiliary  systems" 
introduced  in  I  hi. 

As  the  stability  regions  of  these  methods  are 
fairly  small  for  k>3,  we  may  ask,  whether  methods 
with  slightly  reduced  order  possibly  have  better 
stability  properties.  This  is  the  case  and,  in  fact, 
one  can  even  achieve  stif f _stability,  maintaining 
unusual  high  orders.  In  particular,  we  have  found 
stiffly  stable 

2-cyclic  2-step  methods  of  order  3 

2-cyclic  3-step  methods  of  order  4      .  . 

3-cyclic  3-step  methods  of  order  5 

4-cyclic  4-step  methods  of  order  6. 

The  i  h  methods  La  rcm.-irkable 

thei  ible,  Id  near 

pnumber 

-wn. 


We  now  consider  M-cyclic  k-st:p  methods  of  the  iorm 

z  =C(h);  lz.=Uz.   +h(BIF.+BI,F.  J  ,  j=l(l)p 
o  L  j   U  j-i 

k-V 


r      s 

with   F  :  =  (o,...,o,f  , . .  . , f ,  ,  )  dR  ;  s : =max(M,k; : 
o  o      •'-i 


j    Mj+k-s      Mj+k-1 

and  (s ,s)-matrizes  L,U,B  ,B  ,  where  B  and  B  have 

r 
a  similar  structure  as  L  and  U  and  contain  the  B, . 


Tendler,  Bickart  and  Picel  /8/  have  constructed 
stiffly  stable  methods  of  this  type  with_order  k 
for  the  special  case  B J.=C/.  In  the  general  case  one 
obtains  the  maximum  order  methods  mentioned  in  (7) 
they  were  constructed  in  the  following  way: 


Let  be  p.(i=l(l)s)  the  zeros  of  P (u) : =det (pL-U) 

and  (j.  (i=l(l)s)   those  of  P(y) :  =lim  H~MQ(y;H)  , 

1  H-«° 

where  H  =  Ah  and  Q  is  the  characteristic  polynomial 

Q(p;H):=  det (u (L-HBL)- (U+HBy) ) . 
The  order  q  of  the  stages  is  chosen  as  high  as 
possible  and  such  that  a  sufficient  number  of  free 
parameters  is  available  to  satisfy  the  following 
conditions : 

1.  |u.|<  1,   i=2(l)s    ("stability  at  H=o")   (8a) 

2.  |y.|<  1,   i  =  l(l)s    ("stability  at  H=°°")   (8b) 

and,  if  possible, 

3.  rank  (L-U,c)  =  s— 1    (order  condition)     (8c) 

,1        M   T    r 
c:=(c   .,...,c   ,)  ;  c  ,  :error  const,  of 
q+1'     q+1     q+1    th 

r   stage 
(8c)  is  obtained  from  (6a)  by  multiplication  with 
L  using  Lt=c. 

Each  method  that  satisfies  (8a-c)  is  then  testet 
on  stiff  stability  performing  the  mapping  of  the 
unit  circle  into  the  H-plane  by  Q(y;H)=o. 


Up  to  now,  a  general  existence  theory  of  stiffly 
stable,  cyclic  methods  with  maximum  order  doesn't 
exist.  Also  the  question  of  practical  performance 
and  implementation  have  not  yet  been  sufficiently 
investigated.  A  series  of  tests,  run  at  the  Nuclear 
Research  Center  in  Julich,  Germany,  furnished 
promising  results. 


From  consistency  we  have  always  \i„  *1 . 
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:der  the  non-stiff  system  of  ordinary  dif- 
ferential equations 

(1)  y'-f(x.y);  y(a)=n;  x£[a,b]j  f£Rs ;  s>l. 


.  r.e  the  grid  a=xr|<x.<x„< 


<x  =b  and 
N 


-.epsize  h.=x.-x._1,  i=1(1)N.  The  computatio- 
nal work  per  step  (i.e.  the  work  needed  to  compute 
an  approximation  y.   to  the  true  solution  y(x. ) 

of  (1)  when  all  y.,  j=C(1)i-1,  are  already 

J 
found)   is  smaller  when  a  constant  stepsize  (  h.=h 

*(b-a)/N,   i=1  1 )H  )  is  used.  When  variable  step- 
sizes  are  used  we  accept  much  more  work  per  step 
(needed  to  select  an  optimal  stepsize  and  to  change 

stepsize)  hoping  to  minimize  the  number  of 
steps  (i.e.  the  number  N  of  points  in  the  grid) 
according  to  the  prescribed  error  tolerance  e. 
Different  formulae  can  be  used  in  an  attempt  to 
optimize  the  stepsize  selection  further  (this  is 
connected  with  extra  computational  work  per  step 
too).  Such  methods  are  called  variable  stepsize 
variable  formula  methods  (VSVFM's).  The  only  li- 
near multistep  VSVFM's  used  in  practice  are  bas- 
ed on  the  Adams  formulae  which  have  many  excelent 
properties:    (i)  are  zero-stable  for  any  change 
of  the  stepsize  and  the  formula,  see  Gear  and 
Watanabe   [2];   (ii)  an  error  estimation  can 
easily  be  obtained,  see  Gear  [1],  Krogh  [4], 
Shampine  and  Gordon  [6];   (iii)  the  storage  re- 

>  and  the  computational  work  per  step  are 
smaller  than  those  of  the  other  linear  multistep 
methods.  However,  if  the  absolute  stability  requi- 
reme       Lag  the  numerical  integration  of  the 
particular  problem  are  dominant  over  the  accuracy 

see  Shampine  [5]  )  then  the  possi- 
bility of  using  large  stepsizes  will  be  restricted, 


because  these  methods  have  small  absolute  stabili- 
ty regions.  An  attempt  to  improve  their  performan- 
ce in   such  situations  has  been  done  in   [4]   and 
[6]  where  P.EC,  ..E  schemes  (which  have  better 
stability  properties,  see  the  plots  in   [6]  )   are 
adopted  instead  of  P  EC  E  schemes.  Some  linear 
multistep  formulae  with  large  absolute  stability 
regions  have  been  derived  by   Thomsen  and  Zlatev 
[7].  A  code  based  on  these  formulae  is  discussed 

in  this  paper. 

* 
Consider  the  following 

scheme : 


'VV  -  pkECk+iE 


k-1 

(2)  CAVH+(1AWh.VikVi! 

i=0 

(3)  Ck^wC^ 

k-1 

(U)yn+k=Vn+k-1  +  (1-\)yn+k-2+h.J:neikfn+i 

1=0 


+h6kkW 


(5)  Wf<wW- 


If  a  =ct  =1   then   (2)  -  (5)   represent  an 

Adams  P  EC   .E  scheme.  If  a   and  a      (k=3(1)l2) 

are  as  in  Table  1  then  the  absolute  stability  re- 
ft 
gions  for  the   (a  ,a  )-P  EC   ,E  schemes  are  larger 

than  those  for  the  corresponding  Adams  -  P  EC   .E 

schemes.  A  similar  (but  slightly  more  complicated 

than   (2)  -  (5)  )   scheme  can  be  found  for  k=2 

(see  [7]  ).  An  example  is  given  in  Fig.  1  where 

k=5  and  the  stability  formula  for  our  method  is 

plotted  by  a  bold  line,  the  region  for  the  Adams- 

P.EC^E  scheme  is  plotted  by  a  dotted  line  and  the 

region  for  the  Adams-P  EC  E  scheme  is  plotted  by 

dashes;  more  details  can  be  found  in   [7],  where 
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:he  plots  for  all  k   (k=2(1)l2)   are  given 


* 

k 

\ 

°k 

3 

1.36 

1.U0 

1+ 

1.87 

1 

90 

5 

1.90 

1 

95 

6 

0.55 

1 

50 

7 

1.70 

1 

90 

8 

1.50 

1 

90 

9 

0.50 

1 

80 

10 

-2.50 

1 

50 

11 

-0.60 

1 

80 

12 

-1.00 

1 

80 

ORDER  K=5 


Table  1 


fC*.  MIS 


In  the  code   (2)   and   (U)   are  expressed  in 
terms  of  divided  diffrences.  Define   (  n+k<N,  i>1 


!6-a)  gn+k,0=W    Ck,o"CkJ 


{6'b)  gn+k,i=(gn+k,i-rVk-1,i-1)/Un+k"Vk-i)-' 


C?,a)  r0,TXn+k  Vk-1'   ro,1=Vk  xn+k-2; 


(7-b)  ro,rro,iro,o-1(^1)/J'  0=2(1)^*1^ 

(7-c)  fo.j-'o.l'oj-i"-1^'  Ml*** 

(7-d)  riJ-<W^Wi)ri-1.J*Pi-1.J*V  W")Hl 

(T.e)r.  .=(x  -x  _  jr.  -  .+r.  -  .+1 ,  i-1  (1  )k-l ; 

j=1(1)k-i+1    in  (T.d)  and  (7.e); 

(8-a}  ^,k=akri,1  +  (1-ak)f:i,V  i=°d)k-1; 
(8'b)  Yi+1,k-Vi>2+(1-ak)fi,2i   *"«>**■ 


Then  the  (ak.\)  _  VSVFM        ie  based 
on  the  following  formulae: 

(9)   yr**,=  i*yn+k-1  +  (1-ak)Vk-2+.ZnYi,kgn+k,ii 

1=0 

10)   y        =i    /  „  +  M-ci    )v  ♦  I  y .      & 

;   'n+k     k*n+k-1         Vyn+k-2   .  iYi,kfen+k,i 


*       * 

En+k,0' 


Fig.  1 


(the  differences   g  ,,  .   in  the  sum  in  (10) 

can  be  found  by  the  use  of   (6.b)  with  g  replaced 
* 

ty  g  )• 

In  our  implementation  many  ideas  from  the 
most  efficient  codes   (  [1],   [3],   [4],   [6]  ) 

were  used.  The  code  can  be  used  with  different  va- 

*  * 

lues  of   a    and   a ,  .   By  the  choice   a  =a  =1 , 

k=2(1)l2,  an  Adams  VSVFM  may  be  obtained.  This 

allows  us  to  make  comparisons  in  which  only  the 

use  of  different  formulae  in  the  VSVFM,  but  not 

the  stepsize  and  order  selection  strategy,  will 

have  influence  on  the  results. 

Many  problems  from  [3],   [4]   and   [6] 

were  run.  As  an  example  consider  problem  12  in 

[4]   with   B^Uv,  B2=3v,   B,=-v,   B^O.OOlv, 

v=1(1)12,   e-10  ,   [a,b]=[0,50].  The  numbers 

of  function  evaluations   (ND)  are  given  in 

Table  2. 

The  results  of  the  comparison  between  the 

* 
Adams  VSVFM 's  and  the   (a,  ,a,  )   VSVFM 's  can  be 


k*  k' 


summarized  as  follows: 


(i)   The   (a  ,a  )   VSVFM 's   require  more 
storage  and  computational  work  per  step.  Moreover 
the  error  constants  of  (2)  and  (k)      are  larger 
than  those  of  the  corresponding  Adams  methods. 
Therefore  it  is  not  efficient  to  use  these  methods 
if  the  accuracy  requirements  are  dominant.  But 
note  that  very  often  the  number  of  function  eva- 
luations was  not  larger   (  a  careful  comparison 
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V 

Adams  VSVFM 

(ak,a*)  VSVFM 

1 

289 

227 

2 

1*59 

3U9 

3 

629 

un 

I* 

801 

597 

5 

871 

719 

6 

1139 

839 

7 

1309 

963 

8 

mi 

1085 

9 

1577 

1210 

10 

1815 

1332 

11 

198U 

1U52 

12 

2155 

1 

Table  2 


showed  that  this  is  so  because  higher  orders  were 
selected  by  the  (a^,o  )  methods  ). 

(ii)   If  the  absolute  stability  requirements 
are  dominant  then  the  (a  ,a  )  methods  will  nor- 
mally use  a  smaller  number  of  steps  in  the  inte- 
gration. 

(iii)  The  zero-stability  properties  of  the 

a,  ,ol  )  methods  are  the  same  as  those  of  the 
k  k 

Adams  methods,  see  Zlatev  [8]. 

(iv)  A  one  step  predictor-corrector  scheme 
(  the  Euler  formula  as  a  predictor  and  the  tra- 
pezoidal rule  as  a  corrector  )   is  used  in  the 
starting  stage  and  in  the  situations  where  the 
order  should  be  dropped  to  one  (as  an  example 
consider  the  case  where  discontinuities  in  the 
right -hand  side  of  (1)  appear  for  some  values 
of   x  in  [a,b]),  see  more  details  in  [6]. 

Our  main  conclusion  is  that  (av»av)  methods 
:ar.  s^::essfully  be  used  instead  of  the  Adams 
ods  in  the  case  where  only  the  absolute  sta- 
-  S   requirements  restrict  the  selection  of 
large  stepsizes. 
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1.   PROBLEM  STATEMENT  -  Since  the  exact 
solution  of  a  system  of  m  first  order  ordinary 
differential  equations 


y*  =  f(y,t),  y(tQ)  = 


(1) 


can  be  numerically  approximated  at  only  discrete 
points,  the  comparison  of  analytic  and  numerical 
solutions  to  determine  accuracy  and  stability  can 


only  be  made  at  discrete  values  t 


0, ,n. 


Since  most  numerical  methods  attempt  to  keep  the 

stepsize  h.  =  t.  „-t.  constant  for  as  many  steps  as 

l    i+1   l 
practical  subject  to  accuracy  and  stability 

constraints,  the  study  of  local  accuracy  and 

stability  can  be  carried  out  for  constant  h.  In 

particular,  define  the  analytic  k-step  solution 


sequence  Y( t 


-  wvk+1.>,  y<t0-1f2>'--'*(V) 


and  the  numerical  k-step  solution  sequence  to  be 

Yn  =  (yn-k+1'  Vk+2'  — '  V  Where  yi  i3,the 
numerical  approximation  of  the  solution  y(t  )  of 

(1)  and  each  of  y(t  ),  y  is  an  m  vector.  This 

paper  will  describe  a  software  package  of 

interconnected  programs  and  subroutines  which  can 

be  used  to  compare  the  stability  properties  of 

Y  generated  by  a  particular  multistep  method  with 

the  properties  of  Y(t  )  for  a  particular  nonlinear 

differential  function  f(y,t).   Fig.  1  shows  a  flow 

diagram  of  this  package. 

The  usual  analysis  of  the  stability  of  ( 1 ) 

involves  two  concepts:  the  equilibrium  solution  and 

the  domain  of  attraction.  Define  the  equilibrium 

solution  z(t)  to  be  the  analytic  solution  of  ( 1 ) 

where  z  =  z(t  )  is  chosen  such  that 

z'(tQ)  =  f(zQ,  tQ)  =  0.      (2) 

For  autonomous  equations,  the  solution  is  z(t)  = 
z  .   The  domain  of  attraction  can  be  found  if  there 
exists  a  function  v(y)  >  0  for  y  i   0,  v(0)  =  0, 
such  that  dv(y)/dt  i  0  in  a  doaain  D  about  the 
equilibrium  z(t).   This  region  Is  the  domain  of 
attraction  and,  as  t  increases,  the  difference 
between  any  two  solutions  of  (1)  generated  by 
different  initial  values  w  and  x  within  D 
decreases  with  respect  to  v,  i.e. 
Iv(w(t+h))-v(x(t+h))|  <|v(w(t))-v(x(t))|.   If  the 
function  v(y)  =  y  Qy  is  used,  where  Q  is  a  positive 
definite  matrix  chosen  by  known  techniques  [1], 


then  v(y)  is  <y,y>  for  the  scalar  product  <u,v> 

ufcQv. 


In  fig.  1 ,  box  2  represents  a  routine  which  solves 
for  the  m(m+1)/2  unique  values  of  Q  by  dividing 
f(y,t)  into  its  linear  and  nonlinear  parts  so  that 

f(y,t)  =  By  +  g(y,t) 

Then  a  symmetric  (or  Hermitian)  Q  is  found  such 
that 

BfcQ  +  QB  =  I 

where  I  is  the  m  by  m  identity  matrix.  While  this 
routine  has  not  yet  been  written,  the  linearization 
can  be  automated  by  finding  the  approximate 
Jacobian  B  at  t  =  0  using  column  by  column  finite 
differencing,  requiring  m  evaluations  of  f(y,t). 
Box  3  of  fig.  1  represents  a  routine  which, 
given  an  initial  guess  z  inside  D  will  find  either 
z  ,  the  equilibrium  at  t  =  0,  or  else  z  •,  the 
equilibrium  solution  in  a  plane  specified  by 
holding  m-2  values  constant,  if  such  a  restriction 
does  not  produce  algebraic  contradictions.   This 
routine  is  also  being  written  and  is  discussed  in 
section  H   below.   In  the  sequel,  z  will  represent 
either  z  or  z  *  as  appropriate. 

2.   COMPUTATION  OF  STABILITY  REGIONS  -  A 
concept  similar  to  the  analytic  domain  of 
attraction  is  sought  for  the  analytic  and  numeric 
solution  sequences.   Dahlquist  [2]  has  devised  a 
function 


VG,K,n(V  '  ill  |l 


gij<yn-k+i' 


yn-k+j> 


for  some  positive  definite  matrix  G  =  (g,^,  where 
<u,v>  is  the  scalar  product  above.  LinigSr  and 
Odeh  [3]  have  considered  how  to  choose  G  to  obtain 
the  nonlinear  equivalent  of  A-stabllity,  and 
Dahlquist  [1]  mentions  in  a  recent  report  that  a 
program  to  compute  G  has  been  written  at  Stanford. 
This  is  represented  by  box  1  of  fig.  1 . 

To  facilitate  computational  and  analytic 
manipulations  with  respect  to  multistep  methods,  it 
has  been  shown  [2,5]  that  the  numerical  solution 
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usinj  k-step  multistcp  formulas 

0  =  *   ay   ,+hb.f(y   t   .  ) 
r=0  i'n-i   1  'n-i  n-i 

stability  properties  similar  to  numerical 
solutions  using  one  leg  k-step  formulas 

k 


has  power  series  x(t)  =  jj.  a^  ,  y(t) 
b.t  defined  recursively  by 


i=0  i 


0  Vn-i+  hf(i=0  V.-i'i=0  Vn-i5 


if  .:  ',  b.  =  1.   Software  is  being  developed  which, 
given  a  point  within  the  resulting  domain  finds  the 
boundary  of  the  region  D1  where  VQ  k  n(Y.,)  - 
V  k   h^V  -  °'  which  is  the  discrete  analog  of 
the  domain  of  attraction  -here  v'Cy.)  <.  0.   This 
can  be  used  to  compute  D1  for  either  the  analytic 
or  the  numerical  solution  sequence,  provided  the 
analytic  sequences  Y(t  )  and  Y(t  )  are  available. 
The  k-step  numerical  solution  sequence  is  also 
computed  using  y  and  k-1  exact  (analytic)  previous 
points  to  compute  y   using  the  one  leg  k-step 
method  under  consideration.   Box  1  of  fig.  1 
represents  a  compiler  module  which  transforms  a 
specification  of  the  differential  function  f(y,t) 
into  two  FORTRAN  subroutines  S  and  F.   S(T,Y0) 
returns  the  correct  solution  vector  at  t  =  T  given 
that  Y0  is  the  initial  value  at  t  =  0,  and  using 
techniques  described  in  section  3  below.   F(T,Y) 
returns  the  derivative  of  Y  at  time  t  =  T  given 
that  y(t)  =  Y.   Box  5  of  fig.  1  represents  working 
software  which  uses  brute  force  search  and  then 

bisection  on  up  to  80  lines  radiating  from  z  along 
a  2-dimensional  plane,  storing  the  result  in  an 
array  STAR. 

The  stability  region  D1 '  such  that  V    h^Yn^ 
<.  Vf  can  also  be  computed.   V»  is  max  V   *  f Y) 
over  all  Y  on  the  boundary  of  D1.   This  corresponds 
to  all  initial  values  that  generate  solutions  that 
remain  inside  D1  after  k  steps.   Box  6  of  fig.  1 
represents  the  working  software  that  computes  and 
stores  up  to  80  points  of  the  boundary  of  D' ' . 
These  points  are  computed  using  bisection  between 
z  and  the  boundary  of  D'  stored  in  array  STAR.   It 
ha3  been  shown  [5]  that,  if  a  domain  of  attraction 
D  exists  for  v(y)  =  y  Qy  that  is  convex,  then  the 
boundaries  of  D'  and  of  D''  can  be  computed  using 
rays  from  some  point  within  D' ' . 

3.   AUTOMATED  ANALYTIC  SOLUTIONS  -  Since  the 
analytic  solution  is  required  for  k-1  steps  before 

yQ,  means  of  obtaining  y(t    ), ,  y(t  )  are 

needed.   Translation  software,  box  1  of  fig.  1,  is 
being  developed  to  manipulate  the  specification  of 
a  system  of  m  non-linear  first  order  ordinary 
differential  equations  to  produce  a  FORTRAN 
subroutine  which,  given  y  at  t  =  0  and  some  value 
t,  outputs  y(t)  using  power  series  solution 
techniques  provided  t  is  in  the  radius  of 
convergence  of  the  power  series.   For  example, 
x'  =  -x-2y+x2sin(t) 
y1  =  5x-y+y/(t+JO 


("ai-r2bi-1  < 


V  b0  =  V 
i-1 


ln(V  jio.ajbi-i-j)/i- 


bi  '-   C5a1-1-(t0+3)/(t0+«)b^1)/l. 

The  regions  D,  D' ,  and  D' '  for  the  analytic 
solution  with  q^  =  37  A4,  q   =  16/44,  q   =  q?1  = 
3/1*14 ,  G  =  I,  h  =  .1,  and  k  =  2   appear  in  fig.  2. 

1.   EQUILIBRIUM  -  Although  any 'point  inside 
D' •  can  be  used  to  generate  the  boundary  of  D'  and 
D1 ' ,  if  the  equilibrium  (2)  is  used  then  good 
results  for  the  analytic  case  are  insured. 
Software  corresponding  to  box  3  of  fig.  1  is  being 
written  to  handle  computation  of  z  ,  possibly  using 
an  n  dimensional  chord  method  starting  with  a  value 
that  is  known  to  be  inside  D' ' .   This  is  an 
application  of  the  software  described  in  secton  2 
since  a  message  that  V(Y  )  -  V(YQ)  >  0  can  be 
generated  to  indicate  that  the  point  being  used  as 
the  origin  of  the  rays  used  to  find  the  region 
boundary  is  itself  not  in  the  domain  of  attraction. 
A  special  case  occurs  when,  for  an  n-dimensional 
system,  n-2  initial  values  are  held  constant  and 
the  equilibrium  initial  value  for  the  other  2 
variables  is  found.   This  is  both  computationally 
simpler  and  more  practical  since  only  2-dimensional 

regions  can  be  displayed  on  most  graphics 
equipment.   Since  this  routine  is  not  available  in 
the  current  software  package,  a  z  is  picked  using 
knowledge  of  the  function  f(y,t)  being 
investigated. 

5.  GRAPHICAL  OUTPUT  -  All  of  the  currently  " 
working  software  has  been  gathered  as  subroutines 
to  an  interactive  control  MAIN  program  which  uses  a 
simple  command  language  (the  first  n  positive 
integers)  to  perform  the  n  available  functions. 

For  those  subroutines  not  yet  available,  such  as 
the  computation  of  the  equilibrium,  a  default  (0) 
is  provided  and  execution  of  the  command  will  allow 
the  user  to  input  a  usable  value.   The  0  command 
produces  a  list  of  available  commands;  each  command 
prompts  the  user  in  its  use. 

The  graphics  package,  using  TEKTRONIX  PL0T10 
software  connected  to  a  Control  Data  CYBER  172, 
prints  the  left,  right,  lower,  and  upper  bounds  of 
the  figure  to  be  drawn,  then  allows  the  user  to 
override  these  limits.   This  allows  all  figures  to 
be  drawn  to  the  same  scale  for  comparison.   A 
hardcopy  device  can  be  used  to  save  the  figures. 
The  example  in  section  6  was  drawn  using  this 
package. 

6.  EXAMPLE  -  Consider  the  analysis  of  the 
complex  equation  y'  =  Ly  where  L  =  (In  yQ)/h,  which 
has  two  equilibria,  a  stable  one  at  z  =  0  and  an 
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unstable  one 
distance  from 


r  1,  for  v(y)  =  y  y,  the  Euclidean 
I.   The  analytic  solution 


and  G 

sequence  is  y.  i  y    ,  so  the  analytic  domain  of 
attraction  and  stability  region  are  both  bounded  by 
the  unit  circle,  and  the  unstable  equilibrium  is  on 
the  boundary.   Fig.  3  shows  D'  about  1  for  the 
numerical  sequences  generated  by  the  Euler 
predictor  (dotted),  the  Euler  predictor  with  one 
backward  Euler  corrector  (dash),  and  the  2-step 
Adams  explicit  formula  (solid).   Only  implicit 
formulas  that  are  stable  at  ™  have  domains  of 
attraction  that  include  z   =  0.   In  this  case, 
there  is  no  stability  region  D' '  since  the  circle 
of  radius  V*  about  y  =  0  is  not  inside  D' . 
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Fig.  2  -  Nonlinear  time  dependent  example 
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SECOND  DERIVATIVE  EX- 
TENDED BACKWARD  DIFF- 
ERENTIATION FORMULAE 
FOR  THE  NUMERICAL  IN- 
TEGRATION OF  STIFF 
SYSTEMS 

J .  R .  Cash 
Department  of  Mathema- 
tics, Imperial  College, 
South  Kensington, 
London  S.W.7,  England 


In  two  recent  papers  the  present 
author  has  introduced  a  class  of  extended 
backward  differentiation  formulae  suitable 
for  the  approximate  numerical  integration 
of  stiff  systems  of  ordinary  differential 
equations  of  the  form 


&=  f(x.y).  y(xo) 


(l.D 


Extended  backward  differentiation  formulae 
take  the  general  form 


L     5, 


j-o 


y   .  -   h(6.  f   .  « 
n  +  j         k  n+k 


5k+lf n+k+1 


(1.2) 


where  the  a.'s  and  B.'s  are  constants, 
J          J 

y     is  the  approximate  numerical  solu- 
n  ♦  j 

tion  obtained  at  x   .  and  f     ■ 
n  +  j       n  +  j 

£  ( x   .,y   .).   These  formulae  are  simi- 
•*•   n  +  j   n  +  j 

lar  in  structure  to  the  advanced  multi- 

step  methods  of  Williams  and  de  Hoog  and 

to  a  subclass  of  the  composite  multistep 

methods  considered  by  Bickart  and  his  co- 

workers.  However,  the  way  in  which   the 

required  solution  of  (1.2)  is  generated 

is  entirely  different  from  that  consid- 

ered by  others.   In  an  earlier  paper  it 

was  shown  that  if  these  extended  formulae 

are  used  in  a  certain  precisely  defined 

'predictor-corrector'  mode,  with  a  con- 

ventional backward  differentiation  formu- 

la being  used  as  a  'predictor'  and  (1.2) 

being  used  as  the  corrector,  then  it  is 

possible  to  derive  L-stable  schemes   of 

orders  up  to  4  and  A(a)-stable  schemes 

of  orders  up  to  9.   Numerical  experi- 

ments with  the  resulting  algorithm  on  a 

wide  class  of  stiff  test  problems  have 

indicated  that  it  is  often  competitive 

with  Hindmarsh's  Gear  Rev. 3  which  is 

widely  accepted  as  being  one  of  the  most 

efficient  general  purpose  codes  for  the 

integration  of  stiff  systems  currently 

available.   Numerical  experience  has  also 

shown  that  when  integrating  stiff  systems 

of  equations  with  jacobians  having  large 

complex  eigenvalues  lying  very  close  to 

the  imaginary  axis  using  extended  backward 

differentiation  formulae  it  is 

desirable  that  all  of  the  basic  integra- 
tion formulae  should  be  highly  stable  (not 
just  A  -stable  but  A(a)-stable  with  a 

close  to  it/2).   This  is  because  an  algo- 
rithm employing  A(a)-stable  formulae  with 
a  small  sometimes  selects  a  high  order 
formula  with  a  small  value  of  h  to  pre- 
serve stability  rather  than  correctly 
choosing  a  more  stable  lower  order  formula 
with  a  larger  value  of  h.   The  requirement 
that  all  of  our  formulae  should  be  very 
highly  stable  leads  us  naturally  to  con- 
sidering extended  backward  differentiation 
formulae  employing  second  derivatives. 
These  take  the  general  form 


yn+k  "  yn+k-l 
k+1 


h  /    B.f   .  +  h2y, g   , 
t- j  n+j       'k6n+k 

j-0 


(1.3) 


,  _  df  (x,y(x)) 

where  g   ,  =  -r— 
6n+k    dx 

n  +  k 

,y=yn+k 

Note  that  we  choose  a.  =    0,  j  <  k-1  to 
ensure  stability  when  h  =  0. 

Following  the  general  strategy  adop- 
ted for  the  implementation  of  (1.2)  we  use 
our  formulae  in  a  predictor-corrector  mode 
with  (1.3)  being  the  corrector  and  a  con- 
ventional second  derivative  formula  of  the 
form 

k 

y   , -y   ,  ,  =  h  /   B-f   .+h2y,g   ,   (1.4) 
'n+k  'n+k-1       c_  Mj  n+j     'k6n+k 

j-0 

being  used  as  the  'predictor'.   This  ap- 
roach  may  be  summarised  in  a  step  by  step 
form  in  the  following  way: 

(1)   Compute  a  quantity  y   ,      as  the 
v  t       •'■'n  +  k 

solution  of  the  conventional  linear  k- 

s  tep  f ormu la 
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:'a.k"h3kfn  +  k"h2Yk8n  +  k    =    yn+k-l 


k-1 

+  h  y  e .  f  ^ . 

j=0 


(1.5a) 


(2)   Compute  y   ,  ,  as  the  solution  of 

v  7n+k+l 

v      -ha  f      -1i2yc       ="7 
'n+k+1    k  n  +  k  +  1    YkBn+k+l    yn  +  k 


It  has  been  shown  by  Enright  that 
for  any  integer  k  satisfying  1  <  k  <  7  it 
is  possible  to  derive  stiffly  stable 
schemes  of  the  form  (1.4)  having  orders 
k+2.   It  is  straightforward  to  verify 
that  for  all  positive  integers  k  it  is 
possible  to  derive  formulae  of  the  form 
(1.3)  having  order  k  +  3.   We  now  state  the. 
following  lemma  regarding  the  order  of 
accuracy  of  the  procedure  defined  by 
steps  (1)  -  (4)  above. 


k-2 

+   he,     ,f(x      .  ,y("£)    +   2Z  h6-f   .-.,    (1.5b) 
k-1         n+k       n+k  * j    n+j+1 

j-0 

(3)  Compute  ?n  +  k+1  =  f  ( XR+k+1  ,7^^  ) 

(4)  Compute  y   ,  from  (1.3)  written  in 
the  form 

yn  +  k"hVn  +  k-h^kgn  +  k  =  yn+k-l 


k-1 

Z 

j=0 


hg.f   .+h6,   ,f   ,   , 
j  n+j    k+1  n+k+1 


(1.5c) 


We  note  that  in  implementing  steps  (1), 
(2)  and  (4)  it  is  necessary  in  general  to 
solve  systems  of  nonlinear  algebraic 
equations  for  the  required  solutions.   In 
each  case  these  are  solved  using  a  modi- 
fied form  of  Newton  iteration  iterated  to 
convergence.   If  the  usual  procedure  of 

82f 
neglecting  the  term   - — 7  is  adopted,  the 

coefficient  matrices  associated  with 
steps  (1),  (2)  and  (4)  respectively  are 


he.  J  „.  -  h- 
k  n  +  k 


YkJn+k 


I  -  hB,  J   ,  ,  -  h2y,  J2  ,  , 
k  n+k+1      'k  n+k+1 

and 

I  -  h?,  J   ,  -  h2y,  J2  , 
k  n+k      'k  n+k 


where  J   .  is  some  suitabl 
n+k 

tion  to  the  relevant  Jacob 

x   ,  .   Exhaustive  comparis 
n  +  k                   r 

earlier  for  the  case  where 

shown  that  if,  with  step  ( 

coefficient  matrix  (A)  rat 

then  the  iteration  scheme 

variably  converges  if  the 

scheme  in  step  (1)  converg 

experiments  seem  to  indica 

also  the  case  when  y       +    0. 

it  has  been  found  that  ite 

(2)  normally  converges  if 

matrix  (A)  is  used.   Thus 

only  coefficient  matrix  wh 

compute  and  LU  decompose  i 

leads  to  a  substantial  sav 

.1  ■•  f  f  0  r  t  . 

(A) 
(B) 

(C) 

e  approxima- 

ian  matrix  at 
ons  conducted 

Y,  =  0  have 
k 

4 ) ,  we  use  the 

her  than  (C)  , 

in  step  4  in- 

i  te  rat  ion 

e  s .   Numer  ic  a  1 

te  that  this  is 

Fur  thermore 

ration  s  cheme 
the  coefficient 
in  practice  the 
ich  we  need  to 
s  (A)  and  this 
ing  in  compu t  a- 


L  e  mm  a  1 

Given  that 

(1)  Formula  (1.5a)  is  of  order  k+2 

(2)  Formula  (1.3)  is  of  order  k+3 

(3)  The  implicit  algebraic  equations 

,     r ■     ■         — (n)     ,  — (n)  , 

defining  y   ,  and  y   ,  ,  are  solved  usins 
n+k       n+k+1 

an  iteration  scheme  iterated  to  conver- 
gence, then  scheme  (1.5c)  has  order  k+3. 

This  lemma  is  easy  to  verify  by 
standard  arguments. 


Having  defined  our  basic  formulae 

we  are  now  in  a  position  to  examine  their 

regions  of  absolute  stability.   We  do 

this  by  applying  them  to  the  usual  scalar 

test  equation  y  =Ay,  where  X  is  a  complex 

constant  with  negative  real  part.   It  was 

found  that  these  algorithms  are  very 

highly  stable  for  1  <  k  <  6  and  the  cor- 

responding regions  of  A (a ) -s t ab i 1 i t y  are 

given  in  Table  1.   Note  in  particular 

that  these  formulae  are  L-stable  for  or- 

ders up  to  and  including  6  whereas  En- 

right's  conventional  second  derivative 

formulae  are  L-stable  only  up  to  order  4. 

If  we  consider  still  higher  values  of  k 

we  derive  A(a)-stable  formulae  with  the 

angle  a    decreasing  to  zero  as  k  is  in- 

creased.  However,  we  have  not  investiga- 

ted such  formulae  since,  as  stated  pre- 

viously, we  are  interested  in  deriving  a 

package  where  all  of  the  formulae  are 

very  highly  stable.   Furthermore,  when 

solving  stiff  systems  of  equations  it  may 

not  be  desirable  to  have  too  many  differ- 

ent order  formulae  as  quite  often  a 

change  in  order  results  in  the  coefficient 

matrix  of  the  Newton  scheme  having  to  be 

re-evaluated  and  LU  decomposed  and  this 

is,  of  course,  generally  very  expensive. 

The  exhaustive  testing  of  a  new 

algorithm  is  a  long  and  complicated  busi- 

ness and  at  the  present  time  we  are  un- 

able to  draw  any  firm  conclusions  con- 

cerning the  general  efficiency  of  our 

algorithm.   However,  preliminary  results 

obtained  using  a  fixed  order  (and  some- 

times a  fixed  step)  seem  to  indicate  that 

the  formulae  discussed  in  the  present 

paper  are  promising,  especially  when  com- 

pared with  conventional  second  derivative 

formulae.   It  is  hoped  that  further  re- 

search, particularly  into  choosing  the 
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order  anJ  sccpsize  ot'  the  particular  for- 
mula being  used,  will  lead  to  the  produc- 

D  efficient  code  incorporating 
the  algorithms  described  in  this  note. 

TABLE  1 


Order   Region  of  A ( a ) -s t ab i 1 i t y 
,o 


90 


90 


90 


89' 


87" 


83v 
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ABSTRACT 

In  this  paper  a  new,  computer  approach  to  the 
study  of  the  interactions  of  particles  with 
differing  masses  is  applied  to  the  study  of 
planetary  type  evolution.   The  formulation 
contains  an  inherent  self-reorganization  property 
in  which  particles  self-stratify  in  accordance 
with  their  masses.   Computer  examples  are 
described  and  discussed. 

1.  INTRODUCTION 

Recently  [4] ,  a  new  computer  approach  was  devel- 
oped for  modelling  the  interactions  of  particles 
with  differing  masses.   In  this  approach,  matter 
is  approximated  by  finite  sets  of  particles, 
each  of  which  is  treated  as  an  aggregate  of 
molecules,  and  the  active  forces  are  appro- 
priately rescaled.   Inherent  in  the  formulation 
is  a  natural,  self -reorganization  property,  in 
which  the  particles  self-stratify  in  accordance 
with  their  masses.   After  a  precise  mathe- 
matical and  physical  extension  of  the  method, 
so  as  to  include  both  long-range  and  short- 
range  forces,  we  will  apply  it  to  the  study 
of  planetary-type  evolution. 

2.  BASIC  MATHEMATICAL  DEFINITIONS  AND 
FORMULAS 


For  positive  time  step  At,  let  t 


kAt, 


0,1,2 


For  i  =  1,2,3, . . . ,N, 


let  particle  P.  have  mass  m.  and  at  time 


t,  let  P. 
k      j 


be  located  at  r.  ,  =  (x.  ,  ,y.  ,  ) , 
i,k     i,k  i,k 


have  velocity  v.   = 
i,k 


(v.  ,   ,v.     ) ,  and 
i,k,x   i,k,y 


have  acceleration  a .  ,  =  (a .  ,   ,a.  ,   ). 
i,k     i ,k,x  i ,k,y 

Let  position,  velocity  and  acceleration  be 

related  by  the  "leap-frog"  formulas 

([3] ,    p-     107): 

■*  .    At   -* 

Vi,l/2  =  vi,o  +Tai,o 


i,k+l/2 


i,k-l/2 


(At) 


i,k+l 


r      i  + 


(At)    v 


i,k+l/2' 


(2.1) 

a.    , ,    k  =    1,2,. 
i,k 

.     (2.2) 

0,      k   =   0,1,2.. 

(2.3) 

If  F.  .  is  the  force  acting  on  P.  at  time  t  , 


where  I 


l,k 


(F.  .   ,F.  ,   ),  then  we  assume  that 
i,k,x   i,k,y 


■  and  acceleration  are  related  by 


Fi,k 


i,k" 


(2.4) 


Once  an  exact  structure  is  given  to  F .  ,  ,  the 

i,k 

motion  of  each  particle  will  be  determined 

recursively  and  explicitly  by  (2.1)- (2.4)  from 

prescribed  initial  data.   In  the  present  paper, 

we  will  want  F .    to  incorporate  both  short-range 

and  long-range  components,  and  this  will  be 
implemented  as  follows 

the  distance  between  P.  and  P 


At  time  t,  ,  let  r.  .  ,  be 
k       13  ,k 


Let  G. . 
3        13 
(coefficient  of  molecular-type  attraction),  H. . 

(coefficient  of  molecular-type  repulsion) , 


13 


(exponent  of  molecular -type  attraction),  a. . 
(exponent  of  molecular-type  repulsion)  and  G* . 

(gravitational  constant)  be  constants  determined 

by  P.  and  P.,  subject  to  the  constraints  (see 
1      3' 

[6]):   G.  .  >  0,  H.  .  >  0,  a..  >  B.  .  >  2,  G*  .  >  0. 
1]  -    _13  -  _  13  -  13  -     13  - 

Then  the  force  (F;  ,_  ..  ,F^  u  tr)  exerted  on  P .  by 


i,k,x'  i,k,y 


P .  is  defined  by 


-G. 


F. 


H.  . 


i  /k,x 


■ij,k 


13 


r.  .    13 
13  ,k 


G*.      (m.m.)  (x.  ',  -  x. 
J-3       1-  3    J-.k    3>' 

ij ,k  J        rij,k 


(2.5) 


i  ,k,y 


JJ_ 


13, k 


-G. 


JJ_ 


13  ,k 


JJ_ 


r.  .    13 
13, k 


(m.m. ) (y.  . 
1  3    i,k 


y3,k> 


(2.6) 


•ij,k 


The  total  force  (F .  ,   ,F.  .   )  on  P.  due  to  all 
1 ,k,x  1 ,k,y      1 

the  other  (N  -  1)  particles  is  given  by 


N 


i  ,k,x 


I»< 


3=1 


i ,k,x  '   i,k,y 


N 

j=i  i'k'y 

3Vi 


(2.7) 


The  formulation  (2.1)-(2.7)  is  explicit  and 
economical  though  nonconservative.   Conservation 
of  energy  and  momenta  can  be  achieved  [3] ,  but 
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the  rule 

light-side  particle: 


,  -v  1.001  v.  , 
i,k  i,k 


(3.1) 


=1  Fi».  l. 


only  through  an  implicit,  less  economical 
approach.   Throughout,  the  time  step  to  be  used 

in  (2.1)-(2.3)  will  always  be  it  =  10~  and  a 
comprehensive  FORTRAN  program  for  implementation 
of  (2.1)-(2.7)  is  given  in  the  Appendix  of  [5]. 
It  should  be  noted  that  the  way  in  which  (2.1)- 
(2.7)  will  be  applied  lends  itself  directly  to 
parallel  computation  also  [2] . 

3.  BASIC  PHYSICAL  ASSUMPTIONS  AND  DEFINITIONS 

We  will  consider  a  system  of  137  particles,  so 
that  N  =  137.   This  parameter  was  determined 
solely  by  economic  constraints.   The  initial 
particle  positions  will  always  be  those  shown 
in  Figure  1. 

Next,  while  in  motion  we  will  require  a  rule 
for  determining  the  physical  state  of  each 
particle.   Such  matters  can  be  exceedingly 
complex  (1)  so  that,  for  the  present,  we 
will  use,  as  in  [5] ,  the  following 
intuitive  notion.   We  will  use 
"temperature"  as  a  phenomenon  of  a  particle's 
"local"  velocity,  that  is,  its  velocity 
relative  to  neighboring  particles.   Thus, 
when  a  particle  is  rotating  within  a  large 
system,  the  gross  system  velocities  should 
have  no  effect  on  the  particle's  temperature. 

As  a  last  consideration,  we  will  allow  for 
radiation  into  and  out  of  the  system.   This 
will  be  accomplished  as  follows.   Consider 
the  vertical  strip  regions 

x_^(k  +  1)y;  y>0, 

k  =  0,±1,±2,±3,...   . 

In  each  strip,  the  particles  with  maximum  and 
minimum  y  coordinates  are  called  outer  particles. 
An  outer  particle  will  be  called  a  light-side_ 
particle,  that  is,  it  faces  a  sun,  if  y.  .  >_y.. 

Otherwise,  it  is  called  a  dark-side  particle. 
Light-side  particles  will  receive  radiant  heat 
while  dark-side  particles  will  emanate  radiant 
heat.   This  is  implemented  as  follows.   Every 
k*  steps,  outer  particle  velocities  are  reset  by 


dark-side  particle: 


EXAMPLES 


v.  ,  ♦  0.9955  v 
i,k  i,k 


A  variety  of  examples  were  run  with  various 
combinations  of  parameter  choices  selected  from 
6  =  4,5;  E  =  5,10;  k*  =  50,100;  D  =  2.1,2.3; 
Y  =  0.1,1.0,2.0;  and  G*.  =  0.01,0.001.   From 

these  we  will  first  describe  two,  in  both  of  which 
the  parameter  choices  are  6=4,  e  =  10,  D  =  2.3, 
k*  =  100,  y  =  li  and  G*.  =  0.001. 

The  results  of  the  variety  of  other  examples  run 
can  only  be  summarized  easily  as  follows.   Only 
minor  changes  in  input  parameters  often  resulted 
in  dramatic  changes  in  dynamical  behavior  [4] . 
This  is,  of  course,  consistent  with  the  diversity 
one  observes  among  the  planets  and  moons  of  the 
solar  system. 

Other  computations,  aimed  at  duplicating  the 
development  of  galaxy  arms,  all  ended  in  failure. 
The  systems  invariably  self-reorganized  into  one 
or  more  relatively  elliptic  or  circular  subsystems. 
This  suggests  to  us  the  strong  possibility  that 
more  than  gravitation  is  involved  in  galaxy  arm 
formation. 

Finally,  note  that  initial  calculations  with  a 
239  particle  configuration  were  begun,  but 
these  had  to  be  discontinued  due  to  a  lack  of 
funds. 
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Abstract.   Arbitrary  parameters  for  (s,p-l(p))  Runge-Kutta  procedures 
should  be  selected  so  that  resulting  codes  yield  reliable  results 
efficiently.   Shampine  suggests  as  a  compromise  a  process  for  selecting 
parameters  which  attempts  to  optimize  efficiency  subject  to  a 
constraint  designed  to  provide  reliability.   This  process  is  adapted 
to  choose  parameters  for  procedures  with   p  =  5,6,   and  some 
comparisons  with  known  procedures  are  given. 


1 .   Introduction 

Currently  (s ,p-l (p  )  )-procedures 
are  the  most  efficient  Runge-Kutta 
processes  available.   Pairs  of  formulas 
of  successive  orders  of  accuracy  provide, 
for  example,  an  approximation  of  order 
p-1   and  an  error  estimate  (EE)  of  order 
p   to  the  local  error  (LE).   It  is  known 
that  families  of  these  procedures  may 
be  represented  in  terms  of  arbitrary 
parameters,  yet  there  is  little  agreement 
on  how  they  should  be  selected. 

Production  codes  are  implemented 
so  that  some  function  of  EE  is  bounded  by 
a  user  specified  tolerance.   A  basic 
assumption  in  using  such  a  code  is  that 
the  behaviour  of  the  global  error  is 
reflected  by  that  of  the  local  error.   In 
-.howing  that  this  occurs  for  various 
strategies,  Shampine  £3]  needs  to  assume 
that  the  actual  local  error  is  bounded  by 
the  specified  tolerance.   Accordingly, 
parameters  should  be  selected  so  that  EE 
reflects  the  behaviour  of  LE.   A  second 
more  prevalent  criterion  for  selection  is 
that  of  minimizing  coefficients  of  the 
pi  iiu;ipal  local  truncation  error  in  order 
• '-  increase  efficiency.   To  enhance  both 

tcy  and  efficiency  Shampine  [2] 
in'.  1  compromise ,  a  constrained 


optimization  of  the  parameters.   This 
process  is  modified  below  in  order  to 
make  the  computation  feasible  for  higher 
order  formulas. 

Thus  we  propose  to  derive 
reliable  (|LE-EE|  is  small),  efficient 
(|EE|  is  small),  accurate  (|le|  is  small) 
procedures.   Because  the  optimization  can- 
not be  viewed  as  minimizing  these 
quantities,  the  resulting  formulas  cannot 
be  considered  optimal.   While  potentially 
useful  formulas  may  be  derived,  the 
analysis  is  more  appropriate  for 
indicating  which  procedures  should  be 
avoided. 

2.   Selection  Criteria 

For  elementary  differentials 

(D  . }   evaluated  at  the  current  approx- 
qi 

imation,  error  expressions  in  the  next 
step  are 


LE 


-  hP 


N 

I 
S  T 


P-1T 


s  hq  r  t^d 


i=l  pi   pi    q=p+l   i=l  Q1 


and 


N 


N 

00  c 

+   S  hq  £ 
i=l  Pi   Pi  q=p+l   i=l 


!        1 

EE=hP  S  TP   D 


(Tp-1-TP  )D  . 
qi    qi  Q1 


32-1 


where  _['     and  _TP  are  vectors  of 
nunerica]  conatanta  determined  i>\  the 

■ethodfl  of  orders   p-1   and   p   respert- 
i\cl\.   While  for   h   small,  there  will  be 
little  "contamination"  of  the  error 
ea timet e,  efficient  solutions  for 
moderate  tolerances  will  require  moderate 
to  Large  stepsizes.   Hence,  the  contam- 
ination may  be  substantial,  for  example, 

mv  coefficients  {r        }   are  zero,  or 
pi 
i  i  t  lit-  magnitude  of   Tp   is  substantially 

Larger  than  that  of  T^~ 

These  comments  indicate  that  the 

reliability  and  efficiency  of  a  procedure 

may  be  assessed  by  considering  the 

parameters 


a  =  max  |  T^1  |  , 
i     P1 


I  T?~  _   .  |  =  max  |  Tp7  |  ,   q^p  , 


q-li 


qi 


y      max | Tp~    |       =   max 

TP 

<1       £         qx                    i         qi 

It13:1-/. 

[     qx         qi 

"q           4                TP"1 

> 

qi 

z      =    number    of    coeffici< 

q 

jnts 

q>p 


q>p  j 


JP 


-1 


=  0 


qi 


Following  the  intent  of  Shampine's 
proposal,  new  procedures  are  derived 
to  minimize   a   subject  to  making 

ep+l'   p+l»  Tp+1   sma11  <<  T  if  Possible) 

and   z   =  0  .   (Even  this  optimization 

P 
may  not  yield  reliable  procedures,  for 

example,  if   Tp ,   =  T^  .  for  some 


qi 


qi 


indicies. ) 


3.   Derivation  and  Comparison 

A  four-parameter  family  of 
(6,4(5))  procedures  is  known  [5].   If  the 
nodes  are  confined  to   [0,1]   and 


>6±  *» 


a      is   minimized    if      c, 


m  i±£E 


5 


A  choice  of   c -   so  that 
5 


minimi zea   6,,   and  a  further  choice  of 
Cg   minimizes   ^5  •   The  procedure  V3  has 

nodes  (0,-,^—,^-,-^-,       QQV  )  . 

This    is    contrasted   to    other    procedures : 
I    -    Fehlberg    [l],Vl    [4],  V2    [5~|.       The    fact 


Procedure    z 


5 


P 


^6 


Fl 

F2 
V2 
V2 
V3 


0  2. 08 (-3)  1.67  .20  1.9  2 

0  1.28(-3)  2.59  .55  .46 

0  8.57(-4)  3.77  .67  .36 

5  6.25(-4)  4.44  1.00  .62 


0 


1.87(-3)      1.00        .33      1.16 


Table    1:       Parameters    for    (6,4(5)) 
procedures 


Procedure      FCN  STEPS  MAX    ERR  FRAC    DEC 

Fl           134925  21970  1.3  .002 

F2           120406  19541  10.4  .022 

V2           110803  17903  10.6  .039 

T 

V2      Disaster  for   y  =  -y 

V3     147757   24157      1.8       .002 

Table  2:   DETEST  for  ( 6,4( 5 ) )-procedures 

that  V3  does  not  perform  as  well  as  Fl  on 
DETEST  indicates  the  deficiencies  of  this 
analysis . 

To  obtain  an  (8,5(6))  procedure 
with   j3_  <  1   is  possible  only  for 
a  Si   9.  2  (-4).   As  such  a  procedure  was 
unlikely  to  be  as  efficient  as  DVERK, 
others  were  considered.   Without  con- 
straining $- ,        a   is  minimized  by  the 

,     ,n     4   8   20  9  n   1  ,  ■>      .. 
nodes  (0 ,—,—  ,—  ,— ,\  f—,\)      to  the 

value  1.98(-4);  however   3   =  6.15  .   As 
a  more  reliable  alternate  to  DVERK  we 
suggest  the  procedure  (V3)  with  nodes 

rn  1  10-2/10  5V10  4  ,    1    -n 
(0'"o"'— H '  i5~'Tjl'iT,1)   ' 

Observe  that  the  results  of 
Table  4  are  supportive  of  the  indicators 
of  Table  3.   For   p  =  7,8,   similar  com- 
parisons are  not  so  consistent:   while 
results  from  DETEST  indicate  procedures  of 
[5]  are  reliable  and  efficient,  the 
parameters   3,Yj5,   do  not  adequately 
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distinguish  them  from  other  known 
procedures . 


Procedure 

z6 

a 

P7 

^7 

57 

F 

14 

4.b2(-4) 

2.07 

1.00 

3.  50 

DVERK 

5 

4.62(-4) 

4.  61 

.91 

3.  50 

VI 

0 

1.69(-3) 

.90 

.15 

1.40 

V2 

5 

4.62(-4) 

1.43 

.69 

8.01 

V3 

0 

3.67(-4) 

1.54 

.60 

2.85 

Table  3:   Parameters  for  (8,5(6)) 
procedures 


Procedure   FCN 

STEPS 

MAX  ERR 

FRAC  DEC 

F 

75091 

8913 

22.3 

.063 

DVERK 

73206 

8  603 

15.3 

.134 

VI 

9  2139 

10995 

2.5 

.00  2 

V2 

80282 

9568 

10.  7 

.022 

V3 

76184 

9012 

4.1 

.022 

TABLE  4:   DETEST  for  (8 , 5( 6) )-procedures 


In  conclusion,  we  can  only  advise  that 
procedures  with   z   =f  0,   or  any  of 
.£>  Xt   .§   relatively  large  should  be 
avoided. 
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A  COMPARISON  BETWEEN  SEVERAL  PIECEWISE  CONTINUOUS 
ONE  STEP  INTEGRATION  TECHNIQUES 

J.  P.  HENNART  AND  R.  ENGLAND 


Universidad  Nacional  Aut 

I  I  MAS  (Institute  for  Research  in  Appl 

Apartado  Postal  20-726,  Mexic 

In  a  series  of  papers  by  the  first  author  [ 1 -3] 
a  class  of  one-step  schemes  has  been  developed 
which  is  particularly  well  adapted  to  the  numerical 
integration  of  stiff  systems  of  ODE'S.  These 
schemes  lead  to  piecewise  continuous  approximations 
in  time  with  the  added  possibility  of  exponential 
fitting,  and  provide  therefore  a  very  general  frame_ 
work  for  the  development  of  finite  elements  in  time 
in  connection  with  parabolic  evolution  problems 
[A, 5]  .  In  this  paper,  details  of  some  of  these 
schemes  are  given,  for  cases  which  do  not  require 
second  derivative  evaluation,  and  they  are  compared 
with  similar  alternative  schemes  providing  piece- 
wise  polynomial  solutions  on  the  one  hand  [6,7],  or 
exponential  fitting  on  the  other  [8-10]. 

Given  a  system  of  n  ODE'S  in  autonomous  form: 

y  =  f  ly(t)),  (1) 

"0   \.  ^ 


onoma  de  Mexico 

ied  Mathematics  and  Systems) 

o  20,  D.  F.  (Mexico) 

the  unknown  parameters  y.   ,  r=1 


,  s=l,...,q-1,  being  finally  e 
pl 

(t 


iple  collocation  at  t=t.  and  t; 

1 


y(i) 

fori 

and 

V(S)  (t.+1)=f(s-1)(y   (t.+J),  s=1, 
o-pq     i+l   %      %pq   i+l 


.,P-1, 

iminat 
:ti+1: 
..P-1, 


and 
ed  by 


(6. a) 


,q-1,  (6.b) 


with  v   (t.)  known  either  from  th 
tions  \i=0)  or  from  integration 
interval  [  t .  ,  ,  t .]  .  After  such  a  su 
(3)  becomes: 


e 
ove 


1 n 1 1 1a 
r  the 
bsti tu 


t . 


i+l 


*i+i  =^i  +  I  %q(lrh^'t] 


)dt 


1  condi- 
prev  ious 
t  ion.Eq. 

,  (7) 


t. 
1 


one  gets  after  in  teg rat  ion  over  [  t .  ,  t .  , . ]  : 

1       1  +1 


y  Ui+1)=y(t.)+     f 


i+l 


f(y(t))   dt. 


(2) 


In    the    integral  on    the    righthand    side,    y(t)     is    then 

approximated    by  its    two-point    Taylor    interpolant 

y      (t)  ,    lead  ing  to 

■\-pq 


i+1 


i+1     =    V;    +     [     '"     f     (Vnn(t))dt> 

1+1      *'      j         %    o,pq 


(3) 


i.e.    a    system  of   nonlinear   equations    in    the 


unknowns  y, 


+  1 


Eq.  (7)  defines  a  class  of  discrete 


one-step  ^  integration  methods,  depending  on  the 
quadrature  formula  which  is  used  to  approximate 
the  integral,  in  addition  to  the  choice  of  p  and 
Assuming  that  y    exists  and  is  unique,  each  of 
its  components^alpends  on  (p+q)  parameters  which 
are  the  corresponding  components , of  y>r^, 
r=0,...,p-1  (if  p  >  1),  and  of  y|*{  ,^=0,  .  .  .  ,q-1 
( if  q  >   1)  .  Exp  I icitly 


Wl) 


p-1 

I 

r=0 


.Cr) 


t- 


t-t . 


v 


r! 


where  p,  q  are  nonnegative  integers,  while  each 

component  of  y    belongs  to  the  1  inear  space 
-X/Pq 

S  ={tnexpU  t)  |n=0 N   .  ;m=l  ,  .  .  .  ,M;  IN  =p+q}, 

m  m-1  m 

m 

(A) 

where  the  X  's  are  given  distinct  real  numbers, 
one  of  them  usually  being  zero,  v    is  then 
chosen  to  satisfy  the  p+q  interpolation  condi- 
tions 


y(')  (t.)  =  y.(r>,  r=0 p., 

'pq   «   ;i 


and 
(sj 


.(s) 


'  (t   )  =  y  !  .  S=Q q-1 

^jjq     i+l     'J+l 


(5. a) 


(5-b) 


q-1 

s=0  ^ 


(s) 


t-t. 


i+1 


u^  ID  =  0   ,  r,£=Q, 


6r£  ,  r,£=0,...,p-1 


.q- 


and 

v^(0)  =  0 
vf  (1) 


■s;: 


,  s,£=0,...,p-1 
,  s,£=Q,.. .,q-1 


(8) 


where  h=  t.,.-t.  and  the  functions  u  ,  v  e  S 

.  ,    i+l   1  r'   s 

sat  1 sfy 


,  (9- a) 

,  (9.b) 

,  19. c) 

,  (9-d) 


so  that  (5)  is  automatically  satisfied.  Then,  when 
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some  k  point  quadrature  formula  is  used,  Eq .  (7) 
becomes 

k 
y.  ,  =  y.  +  h  E  w,,f  (y   (t.  -M3»h)) 


with  0  <  0.  <••  •<  0,  <  1 ,  that  is 

k 


(10) 


i+1 


y- 


«i,-rfl^lr)".(v^!^l:!  ».««"*»  ft*- 


s=ov 


(ii) 


Restricting  attention  to  the  case  p,q  <2  with 


srn 


,  the  space  of  polynomials  of  degree  less 


thaR  8r  equal  to  (p+q-1),  (11)  leads  to  a  class  of 
(k+2)  stage  Runge-Kutta  schemes  which  can  be 
characterized  by  the  following  array  in  Butcher's 
notation  (and  with  p=q=2) : 


W  Vo((V 


Yo10!1  W 


(12) 


u  (ek)   wlVo(0k) 


wkvo(0k)   Vl(0k) 


„,  •  •  •   "k 

The  special  cases  0.  =  0  and  (or)  0,  =  1 ,  and  those 

with  p  and  (or)  q  <  2,  are  easily  derived  from  this 

general  class  by  noting  the  properties  (9)  of  the 

f unc t  ions  u  ,  v  . 
r   s 

A  number  of  particular  cases  are  of  interest,  among 
which  the  following  may  be  mentioned. 
p=1 ,  q=0,  and  any  quadrature  rule  exact  for  a  cons- 
tant  integrand,  gives  the  forward  Euler  method. 
p=0,  q=1  ,  and  any  quadrature  rule  exact  for  a  cons_ 
tant   integrand,  gives  the  backward  Euler  method. 
p=2,  q=0,  and  any  two-point  quadrature  rule  exact 
for  a  linear  integrand,  gives  a  3  stage,  second 
order,  explicit  Runge-Kutta  method.  The  family 
with  O.=0  gives  all  the  2  stage  methods  of  that 
type. 

p=q=1 ,  and  any  two-point  quadrature  rule  exact  for 
a  linear  integrand,  gives  a  2  stage,  second  order, 
implicit  Runge-Kutta  method.  The  family  with  0.=O 
consists  of  semi -expl ic i t  methods,  with  Butcher's 
array: 


(13) 


0 

0 

0 

■1 

1 

2 

>-k 

1 

which  include  the  well  known  cases:©-,  ■  y  (implicit 
methodj        I  (trapezium  metnod) . 


p=0,  q=2,  and  any  two-point  quadrature  rule  exact 
for  a  linear  integrand,  gives  a  3  stage,  second 
order,  implicit  Runge-Kutta  method.  The  family 
with  0-=  1  consists  of  2  stage  methods,  but  these 
show  no  apparent  advantage,  in  terms  of  effort  or 
stability,  over  methods  of  third  order  with 
p=1,  q=2. 

p=2 ,  q=1 ,  and  any  three-point  quadrature   rule 
exact  for  a  quadratic  integrand,  gives  a  4  stage, 
third  order,  implicit  Runge-Kutta  method.  The  case 
with  0.=  0,  0_=  2/3  is  a  2  stage,  semi-explicit, 
Radau  method,  of  higher  order  than  those  given  by 
(13),  but  of  limited  interest  because  it  has  a 
bounded  stability  region. 

p=1 ,  q=2,  and  anv  three-point  quadrature  rule 
exact  for  a  quadratic   integrand,  gives  a  4  stage, 
third  order,  implicit  Runge-Kutta  method.  The 
family  with  0,=  0,0,=  1  consists  of  3  stage 
methods,  amongst  which  should  be  mentioned  the 
Radau  and  Lobatto  methods  obtained  with 
0,  =  1/3,  1/2,  2/3  having  Butcher's  arrays: 


1/3 

5/12 

-1/12 

1 

3/4 

1/4 

3/4 

1/4 

(20) 

0 

0   0   0 

0 

0   0   0 

1/2 

1/8  1/2-1/8 

2/3 

2/9  2/3  -2/9 

1 

1/6  2/3  1/6 

1 

1/4  3/4  0 

1/6  2/3  1/6 

1/4  3/4  0 

the  first  showing  possible  advantages  both  for 
storage  and  convergence  during  solution  of  the 
nonlinear  equations  (7)  at  each  step. 
p=q=2,  and  any  three-point  quadrature  rule  exact 
for  a  cubic  integrand,  gives  a  5  stage,  fourth 
order,  implicit  Runge-Kutta  method.  The  case  with 
0=0  ,0,  =1  is  a  3  stage,  Lobatto  method  with 
Butcher's  array: 


(21) 


0 

0    0 

0 

1/2 

5/24  1/3 

-1/24 

1 

1/6  2/3 

1/6 

1/6  2/3 

1/6 

an  A-stable  method  which  has  been  given  by  other 
authors. 

At  this  point,  it  is  interesting  to  point  out  that 
the  "optimal"  Runge-Kutta  schemes  based  on  Gauss- 
Legendre  quadrature  [6]  do  not  appear  as  special 
cases  of  the  above  formalism.  These  optimal  schemes 
are  more  finite  difference  oriented  in  the  sense 
that  they  achieve  superconvergence  at  the  mesh 
points.  However,  as  is  well  known,  they  lose  accu- 
racy between  them.  Our  schemes,  which  were  studied 
in  detail  in  [  1]  for  general  p  and  q,  do  not  pres- 
ent that  disadvantage:  order  p  +  q  is  achieved  not 
only  at  the  mesh  points  but  also  between  them, 
under  the  only  assumption  that  the  quadrature  rule 
is  exact  when  appl  ied  to  the  members  of  S.  The 
basic  idea  for  developing  such  a  formalism  was  to 
provide  a  piecewise  continuous  representation  in 
time  that  could  match  in  a  fully  consistent  way 
the  high  order  accuracies  offered  by  finite  ele- 
ments in  space  for  parabolic  evolution  problems. 
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It  turns  out  that  in  the  polynomial   case,  the 
stability  properties  on  I  >  depend  on  p  and  q  (if 
the  quadrature  scheme  is  of  sufficient  accuracy): 
namely  the  schemes  with  p=»q  are  A-stable,  while  the 
schemes  with  q=p+l .  p+2  are  strongly  A-stable. 

In  the  general  case  where  exponential  basis 
functions  are  present  (Eq.  (M),  it  is  possible 
to  show  [5]  that  the  above  schemes  when  applied  to 
the  test  equation  y=  \y  are  equivalent  to  the  multj_ 
point  Pade  approximant  schemes  interpolating  the 

exponential  N   times  at  x=h^m«  m=' M>  and  at 

least  once  atmx=0.  These  schemes  have  been  studied 
and  compared  with  other  exponentially  fitted  sche- 
mes [8-10]  in  reference  II  in  relation  with  systems 
of  nonlinear  ODE'S  and  some  of  them  applied  in  ref- 
erence 12  to  the  nuclear  reactor  point  kinetics 
equations.  To  be  applicable,  they  required  that  the 
equations  (if  nonlinear)  be  linearized,  and  that 
mean  values  of  the  parameters  be  taken:  both  approx 
imations  introduce  third  order  local  errors,  and 
the  corresponding  schemes  were  therefore  limited 
to  second  order.  This  is  not  the  case  for  the  new 
schemes  presented  here  which  retain  order  p  +  q 
(or  at  least  p+q-j .  if  j  is  the  number  of  inter- 
polation points  that  remain  far  from  the  origin) 
for  any  problem.  As  far  as  stability  is  concerned, 
all  the  results  previously  obtained  for  the  multi- 
point Pade  approximants  are   valid  here  [8-12]. 

For  a  general  space  S  of  approximants  as  given 
in  CO,  it  is  again  required  that  the  quadrature 
rule  be  exact  for  its  members.  In  the  presence  of 
exponential  basis  functions,  this  problem  is  less 
documented  than  in  the  polynomial  case,  although 
it  is  known  that  Gauss- type  formulae  exist  for 
such  Chebyshev  systems  [13l.  Since  very  few 
realistic  problems  would  require  fitting  of  more 
than  one  exponential  node,  an  attractive  alternative 
may  be  described  as  follows.  First  multiply  both 
members  of  (1)  by  z(t)  to  get 


z  y  =  z  f  (y(t)J, 
or  equivalently 


(22) 


(z  y) 


z  y 


+  z  f(y(t)). 


(23) 


If  (23)  instead  of  (1)  is  integrated  over  [t.,t.+.] 
and  y  replaced  as  above  by  y    in  the  right  nandside 
of  tt»e  resulting  express  ion^,  i  t  is  easy  to  realize 
that  if  z(t)  is  chosen  to  be  proportional  to  exp 
(-At),1  ^0,  the  components  of"  y  behaving  as  exp  (At) 
will  be  integrated  exactly  independently  of  S  (and 
of  the  associated  quadrature  rule). 

Apart  from 

(i)  a  family  of  schemes  for  advancing  the  solu- 
tion of  Eq.  (1)  from  t.  to  t.+.,  a  practical 
implementation  also  requires: 
(ii)a  method  for  estimating  the  local  truncation 
error; 
(iii)a  strategy  for  selecting  the  step  length  h, 
and  possibly  also  the  order  or  particular 
member  of  the  family  to  be  used. 
In  addition,  any  implicit  scheme  involves  the  solu_ 


tion  of  a  system  of  nonlinear  equations  such  as 
(7),  and  the  implementation  also  requires: 

(iv)  an  a  Igor i trmfor  solving  a  system  of  nonlinear 
equations,  while  if  output  is  required  tit 
points  much  more  closely  spaced  than  the 
grid  points  used  for  the  integration: 
(v)   an  interpolation  process  should  ideally  be 
provided.  Some  comparisons  have  been  perfor_ 
med  on  complete  implementations,  without 
separating  the  five  aspects  mentioned  above 
[  1^1  .  However,  in  relation  with  this  work, 
a  modular  structure  is  being  developed  with 
each  aspect  treated  in  a  separate  routine, 
thus  making  it  easy  to  keep  the  elements 
(ii),  (iii),  (iv)  constant,  and  vary  the 
schemes  (i)  used  -(v)  depending  upon  (i).  It 
will  also  facilitate  a  later  study  of  the 
effects  of  varying  (ii),  (iii)  or  (iv). 
Detailed  run  statistics  will  be  reported  later. 
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tbstrnct  : 

is  well  known  (Dahl^uint   1163) 
t  for  the  studv  oc    the  A-stability  of 
linear  Multislep  Methods  for  solvinp  IVPs 
3Es,  A-stable  method  for  order  hipher 
than  two  do  not  exists.   This  paper  pives 
the  necessary  and  sufficient  condition 
for  the  general  multistep  methods  to  be 
A-stable.   It  includes  an  example  for  a 
new  A-stable  -lethod  oc  order  u. 

Introduction  : 


Let  us  recall  the  definition  of  the 
"eneral  Blocking  Over- Impl  ic  it  '-'ultistep 

"1  "ethod,  by  Hassan  Kasr  in  Aolikace 
"athematikv  (under  printing).  The  method 
is  used  ror  solvinp  the  ODE 


jr=f(x,y),  xe[a,b] 
with  the  inital  condition 
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e  ap 
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is 
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ef ined , 
Linsch  it z 
00 ,°°)  so 
ess  of 
nder  the 
nteed 
ce  o* 
hat  in 
proximate 
ously  in 
known  in 

po  ints 
is  a 


positive  integer  am!  h  >  0   is   the 
integration  step,  will  be  called  basic 

points  and  the  points  x..   .=x   +w.h, 

]  k  + 1   ■)  k   ! 

i=l,...,k-l  where  w.  are  any  real 

numbers,  will  be  called  intermediate 
points  while  the  approximate  solution 
the  point  x.,   .  (  i  =  0  ,  .  .  .  ,  k-1  , 


)  will  be  denoted  bv  y(x.    ), 

lk+  i 


1=0,1, 

the  s-d iment ional  vector 


ik"  *  *  'Xik  +  s-l 


)  and  the  s-dimentional 


x  .  =  (  x 

-  1 

vector  of  the  approximate  solution  at 
these  points  bv  VC  x  .  )  =  (v'C  x  . ,  ),..., 

y(x.,     ,  ))   and  analoricall v  the  vector 
"I  k  +  s-1 

z  .  and  v( z . )  by 

-1  -1 


— j    i  k  +  s  ' 


'X(i+l)k+s-l) 


and 


y(0=(y"(xik+s 


),...,r(x(.+1)k+s_1))' 


and  usinp  the  vector- valued  functions 

f(x./v(x.))  and  ^(z.,  v" (  z  .  )  )  at  the 

—1    —1  —1       -     -1 

correspond inp  points.   If  B,D  be  any  kxs 
matrices  and  C  any  k*k  matrix  of 
constants,  then  the  system: 

^(z.  )=B  ^(x.  )+hC  f(z..,y'(z.))+hn  f(x.,f(xJ) 

•1=0,1,.  ..  (3) 

represents  an  alporithm  o f  the  so  called 
HBOM  method. 

We  define  the  local  trunction  error 
of  the  method  by  the  k-d imens iona 1  vector 


L(v(x),h)= 


y ( s+wsh  ) 


y(x+wk-lh) 
y( x+(m+w0)h ) 

y(x  +  (m  +  w   ,  )h  ) 
L/         s  _  i   _1 


-hC 


v ' ( x+w  h  ) 
s 


V  '  (x  +  w,    h  ) 
k-1 

V ' ( x+ (n  +  wQ )h ) 


y  *  (  x+(m  +  w   ,)h) 


v( x+w  h  ) 
o 


y(x+w  _,h) 


■  hn 


y ' ( x  +  w  h  ) 
o 


v ' ( x  +  w   , h  ) 

_      s- 1 


(4) 
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If  the  cirst  k-s  components  of  the  local 
truncation  error  is  of  order  lv  while  the 
remaining  s  component  are  of  order  hP+1 
we  can  rive  a  .weak  definition  o^  the  order 
oc    the  method  as  the  order  p(p  _>  1) 
w.r.t.s  if  and  only  if  the  following  kp+s 
eauations  are  satisfied. 


s 

.  £   h  •  •  =1 
j=l  H 


for  i=l, 


,* 


(5) 


the  necessary  and  sufficient  condition 
for  the  convergence  of  the  solution  of 
the  iterative  system  of  equations  (10) 
i.e. 


|w(z) | 


0(z) 


0i;L(z) 


0  ,  (z) 

si 


■  n   («) 


,0   (z) 
ss 


<  1 


s 

'  "  i   •  -     b  ,•  a 
s-l+i  j=i   il 


k-s     n 
q     r  y  o-l 

'i-l=qCi=l  w 


S-l  +  1 


for  Re  z  <    0 


Example  for  a  GBOM  Method  : 


(11) 


k  i  s 

^  c  .  .  ( m  +  w   ,-,.•)"   +  , 

n     s-k-l+i       1=1  ii   -i-i 


i=k-s+l 

for  i=l,...,k-s,   q=l,...,p-l 


:    d,_.  wq_1] 


(6) 


Takinr  k  =  2,  s=l,  m=l,  w  =0,  w  =£ 

o      1  2 

and  solving  equations  (5),  (6)  and  (7)  as 
9-equations  in  8-unknown  parameters  i.e. 
the  method  is  of  order  4  and  it  can  taVe 
the  form 


( m  +  w        )  -  -   E   b  .  .  w  . 

s-k-l+i      j=i   n   1-1 


y    =v  +  —  ( 5f   +  8f    -f    ) 
n+1   n   24     n      n+1   n+2 


(12) 


k-s 


a-1 


=q[.^   c..w       +         c..(m+w   .  . 

i=x  n  s.1+i  i=k_s+1  it    s-k-i+- 


o-i    y  o  =  y  +  -  (f  +  i-f   +f  „) 

^        Jn+2  Jn   6    n      n+1   n+2 


The  A-Stability  of  the  Method  : 


,q-i 


+.Z   d..  wq-x]  ,  for  i=k-s+l, . . . ,k, 
i=l   il   i_i 


q  =  l,  •  .  •  ,p 


(7) 


A-Stability  Condition  for  a  GBOM  Method  : 

Solving  the  equation  y '  =ciy ,  Rert<  0' 
eauation  (3)  can  take  the  form 

(1-z  C  jy(  z  .  )  =  (  B  +  z  D)y'(x.),   where  z  =  h 

1  1 


(I-zC) 


21+1 


21+2 


v2i+l 
y2i+2 


=  (B+zD)y 


12 


•6z  +  12 


2j 


2  -i 
z 


z^+6z+12 


21 


The  last  equation  is 


(13) 


(14) 


*y(z.)  =  (I-z  C)"1(B+zD)y'(x.)       (8) 
If  0(z)  =  det  (I-zC) 
The  lest  s  equations  of  this  vector  is 


^21+2 


z'+6zt!2 
z2-6z+12 


y2.=»U)y2. 


(15) 


where  w(z)  is  the  complex  variables 
f unct  ion 


ik  +  k 


0(z) 


_yjk+k  +  s-l 


Or  the  form 


Q,,(z).. .0,_(z) 


11 


In  +  l 


'.'(z)  Y 


Is 


0  . (z)  .  .  .n   (z) 
si        ss 


ik 


ik+s-1 
(9) 


do) 


Uaine  the  fixed-point  theorem,  it  is  clear 
that  the  necessary  and  sufficient  condition 
for  the  A-stability  of  GBOM  method  (3)  is 


,  N    z  +6z+12  „      .  n 
r(z)  =  Re  z  <  0 

z2-6z+12 


(16) 


Applyinp  the  maximum  modulous  principle. 
Then  the  modulous  of  this  function  attains 
Its  maximum  value  at  the  boundary 


(i.e.  at  z  =  iy ,  z=«>) 

which  fives  |w(z)|  <  1 

from  (10),  (15)  and  (17)  the  methods  (12) 

which  is  of  order  4  is  A-stable. 


(17) 


.34-2 


e'erences: 

1)  Dahlnuist,    R.    "A    foecial    Stahilitv 
proMe"i    for    linoar    Multisten    Method 

:     T     3     (1063)     27-H3. 

2)  "ear,  T.H.  "Numerical  Initial  Value 
Problems  in  Ordinary  "^inferential 
Equations  "Print  ice- Hall ,  Inc. 
(1071)  212-222. 

3)  Hassan  N'asr.  "Generalized  Blocking 
Over- implic it  Multistep  Method" 
APLIKACE  HATEMATIKY,  Under  Printing 
(1079)  . 

u  )   Hassan  Ilasr.  "Necessary  Conditions 
for  the  Generalized  Dlockin"  Over- 
inplicit  Multistep  Method.  "APLIKACE 
"MATIKY   Under  Printinr  (1079). 


34-3 


Local  error  estimates  for  linear  methods  for  ordinary  differential  equations 

G.J.  Cooper 
University  of  Sussex 


1 .    Introduction 

Most  methods  used  for  the  numerical  solution 

of  ordinary  differential  equations  may  be 

characterized  by  a  pair  of  matrices   (A,B)  . 

Here,  local  error  estimates  for  such  methods  are 

obtained  as  linear  combinations  of  computed 

values.    (This  is  equivalent  to  constructing 

pairs  of  imbedded  methods.)   The  local  error 

estimates  depend  on  the  assumption  that  exact 

solution  values  are  used  to  start  the  step  but  it 

is  shown  that  this  assumption  is  not  required  for 

certain  estimates.    Some  hybrid  methods  have 

such  independent  estimates,  and  also  have  simple 

step  length  change  procedures. 

Consider  an  initial  value  problem 

x'  =  f(x)  ,   x(t0)  =  x0  ,    f  :  Rn  ->  Rn  , 

and  suppose  that  approximations  are  required  on 

[tg,t()+H]  .    Choose  a  step  length  h  so  that 

Mh  =  H  ,  M  a  positive  integer,  and  let 

t   =  tn    +  mh  ,  m  =  1,2,...,M  .   A  linear 
m    v 

s-stage  method  computes,  in  step  m  , 

s   n-element  vectors, 
(m)     s 


yi 


a-.y^-h  I   b..f(y<m))  , 
j=l  lJ  J 


j=l  lJ   J 


,(m) 


for   i  =  1,2,... ,8  ,   where  y>  '  approximates 

x(t   +  c.h)   and  c  =  (ci  ,c-> , .  .  .  ,  c  )    is  a 

m-1   i  ' '  *     '    s 

consistency  vector.    The  method  may  be  expressed 
as 

Y(m)  =  AY(m-l)  +  hB^On))  ,  (1) 

where  Y(m)  =  yi(ln)e  y2(m)  ».  ..•y<n)  and 
F(Y(m>)=  f(yfm))  9   f(y£m))  •  ...«►  f(y(m>)  are 
elements  of   Rns  .   The  linear  operators  A  and 
&     are  defined  by  matrices  A  and  B  ,  the  rows 
of  these  matrices  determining  linear 
combinations  of  the  s   components  of  y^"1-*-' 
and  F(Y(m))  . 


If  step  m  is  started  with  exact  solution 


values 
_(m) 


for 


s  s        (m) 

E  a..x(t   ,+  c.h)  +  h  I  b..f(z)  ')    , 


j-i  u 


m-2 


=  1  !J 


],2,...,s  ,  and  this  may  be  expressed  as 

(2) 


z(m)  _  AX(m-l)  +  hBF(z(m)}  _ 


The  local   error   in  step  m  is   X 


(m) 


Z     but 


the  essential   local   error   is  defined  to  be 

.w/v.(m)    ~(m).    ,         .      .  .    M 

A  (X    -  Z   )   where  w  is  a  given  integer 

For  a  Runge-Kutta 


associated  with  the 

method 

method 

,  0  . 

..  0   1 

1  ° 

0   1 

A  = 

0  ...  0   1 

and  w  =  1   so  that  the  essential  local  error  is 
merely  the  error  in  the  final  stage  (repeated  s 
times).    For  predictor-corrector  pairs  using 
methods  of  equal  order,   w  =  0  ,  but  because 

these  are  multi-step  methods  some  components  of 

v(n)     (m) 

X    -  Z     are  zero. 


The  aim  is  to  determine  a  matrix  Q  so  that, 

(m) 


for  the  corresponding  linear  operator,   QZ 

estimates  Aw(X(m)  -  Z(m))  .    For  a  Runge-Kutta 

method  the  rows  of  Q  will  be  identical.    For 

predictor-corrector  pairs  some  rows  of  Q  will 

(m) 


may  be 
That  is,  some  estimators 


be  zero.   For  some  estimates  (JZ 
replaced  by  QY(m) 

Q  are  independent  of  the  localizing  assumption 

™,(m)      .        »w.„(m)    „(m). 
and  QY     estimates  A  (X    -  Z   )  . 

2.    Conditions  for  error  estimates 

T 
Let  p  =  (pi,  P2,...,p  )   be  a  given  vector 

with  positive  integer  elements  and  define 

hi 

->n 


.  mfx  MPi||y.||  ,    Y   =  viGyo*.  .  .toy 
1 M  l  "    l "  '  s 


where 


is   a  norm  on 


Suppose   a 
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method  is  used  to  compute   Y    ,Y 


,(M) 


(which  depend  on  h   and  hence   M  ).    If  (for 
suitable  starting  values)   ||X    -  Y  m  ||  <   k  , 
m  =  1,2,...,M  ,  for  all  sufficiently  large   M  , 
M  i  M'  ,  then 
l|x(t 


.+  ch)  -  yfm)||  £  KhPi  ,  i  =  1,2 s  , 

m-1    1      i 


and  the  method  is  said  to  be  order  p  convergent. 
A  method  is  order   p   consistent  if  for  M  >,   M' 
*  K  , 


,x(m)  _  z(m). 


M 


,Aw(x(m) 


,(m). 


K 


m  =  1,2 M  , 

llM  *  M  ' 

for  some  integer  w  i  0  .    (A  more  general 

definition  is  possible  [3].)   A  stable  hybrid 

method  is  order  p   convergent  if  it  is  order   p 

consistent.    In  the  present  general  context, 

a  hybrid  method  is  defined  by 
,  max 


Ae. 

i 


=  0  ,   i  =  1,2,. . . ,s  , 
is  the  natural  basis  for 
of   A  ,  this 


where  ej ,  e2, . . . ,e 

s  .  . 

R   .    Since   Ae.   is  column   l 

l 

means  that  stage  i   in  step  m-1   is  used  in 
step  m  only   if  this  stage  is  of  maximum  order. 

To  estimate  the  essential  local  error,  a 
matrix  Q  is  required  so  that  (for  M  >,   M') 


|Aw(x(m)_  ZM)    _   QZ(m), 


M 


£1 

M2 


1,2,. 


,M 


Since   AW(X(m)-  Z(n°)-  QZ(m) 


m        .w>  ,,,(m)   „  (m)  s    ,.„(m)      ...  . 

(Q  +  A  )(X   -  Z    )  -  QXV    ,  sufficient 

conditions  for  Q   to  be  an  estimator  may  be 

obtained  directly  from  the  algebraic  conditions 

for  order  p  consistency  [2].   Let 

T 
e  =  (1,1,..., 1)    and  let   C  be  the  diagonal 

matrix  whose  diagonal  elements  are  the 

elements  of  the  consistency  vector. 


Theorem  1 


Suppose  a  stable  hybrid   (A,B) 


method  is  order  p   consistent  for  a  given 

•  ^i_       ^  min 
order  vector  p  with  p.  <  2   .   p.  , 

J   J_ 

i  =  1,2,. ..,8  .   Then  Q   is  an  estimator  if, 

'for  each   i  =  1,2,. ..,s  , 

eT  QCTe  =0,   t  =0,1,..., p.  +1, 

t    w   T,r1    T<-1 

e!  fO+Aw)  BC  "   . 


BC 


[CT°  -  A(C-I)T° 


Bc/^e 


Tq+Tj  ♦...+  t  £    p.  +  1  ,  where   u  =  0,1,2, 


and   t  r; ,  t  ,  ,  .  .  . 


,t    take  all  possible  positive 
U 


3.    Independent  estimates 

An  estimate  is  independent  of  the 

localizing  assumption  if 
k' 


||Q(Z(m)_  yW)! 

Let   U(m)  =  Z(m)-  Y(m) 


,  m  =  1,2, 


,M  . 


Then  (1)  and  (2) 


give 
U 


(m) 


^A^U 


(0)  ♦  mil   LlCL<*+>-   Z^)  + 

i=l  ~ 


h  \      AiBrF(Zm  i} 
i=0 


F(Y(m_i))] 


Let  m  >  v  ,  a  positive  integer,  and  assume 
QA  =0  so  that  the  expression  for  QU 
simplifies.    The  function  differences  may  be 
replaced  by  a  derivative  expression  and  (1) 
applied  again.   With  further  assumptions  it 
becomes  possible  to  give  conditions  for 
independence.   The  following  result  is  a 
particular  case. 

Theorem  2     Suppose  the  conditions  of 
theorem  1  hold.    Suppose  that   QA  =  0   and 
QBA  =  0  ,  v  a  positive  integer.   Let  p.  =  p 

i  =  1,2, ... ,s  ,  and 

T    x  -1       Tj-1 
QBA   BC  v      . . .  BC 


r  L  n 

[C  °-  A(C-I) 


where   t  =  1,2,3,..., 


t0  BC     ]e 


0,1,2,. 


0  , 


and 


T0»T1» 


take  all  positive  integer  values. 


Then  (for  suitable  starting  values) 

IQ  (Z(m)-  Y(m))||   S  4  .  m  *   v  .  M  5  M'  . 

ii   M 

Such  an  estimator  is  called  a  Q(v) 
estimator.   The  condition  m  >,   v   is  crucial 
because  the  estimate  is  independent  of  the 
localizing  assumption  only   after  v  steps 
have  been  completed  (and  this  applies  to  each 
step  length  change).   The  new  starting 
conditions  have  to  be  chosen  to  maintain  order 
p  convergence. 

Consider  the  Adams-Bashf orth  and  Adams- 
Moulton  methods  of  order  k  used  in  the  PECE 
mode.   A  Q(k)   estimator  may  be  obtained  and 
this  is  just  the  Milne  estimate.   An  extension 
of  theorem  1  provides  estimates  for  Runge-Kutta 
methods  of  order  four  (including  the  classical 
fourth  order  method).   These  estimates  are 
obtained  by  imbedding,  using  three  extra  stages 
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(two  extra  function  evaluations),  but  may  not 
be  independent. 

The  following  method,  represented  as 
p|A|B|Q|c,isa  hybrid  method  with  a 
Q(l)  estimator  (and  w  »  1).   The  first  three 
stages  give  a  hybrid  method  requiring  two 
evaluations  of   f   in  each  step.    Since  the 
second  stage  is  of  order  three,  and  is  not 

3 

3 


0 

0 

1 

0 

0 

I 

0 

3 

0 

0 

u 

4 

1 

8 

0 

0 

0 

"7 

7 

5 

16 

0 

0 

0 

21 

21 

1 

8 

0 

0 

0 

T 

7 

used  in  the  next  step,  it  may  be  used  to  halve 
the  step  length.    The  final  two  stages  are 
needed  only  for  error  estimation  and  one 
function  evaluation  is  required  in  the  next 
step;   overall  one  extra  evaluation  of   f   is 
required.    A  similar  method  of  order  four  is 
known. 


3 
4 

0 

0 

0 

2 
7 

8 
7 

0 

0 

452 
567 

184 
567 

8 

81 

0 

8 
~7 

16 

7 

3 
7 

27 

7 

0    0 
0 

0 


-1 

3 
~4 


_16 

21 


16 
21 


The   method   is   also  given  by    the   equations 
(m)  (m-1) 

yi       -  Y3  . 

(m)        1      (m-1)      3      (m-1)      3  ...    (m), 

y\      =  i  yi       +  i  y\       *  ■%  hf(y}  ')  , 

1      (m-1)      8     (m-1)      2  ...    (m),    x   8  ...    (m) , 

y3     s-,yi       +  -  y3      -  ■?  h* (yi    )  +  -  hf (?2    )  , 


21 

1 

7 


7  "■  7  J3  7     * w*     '        7 

5       (m-1)      16      (m-1)      452  ...    (m),        184  ...    (m),    J      8    ...    (m), 

y*     -  tt  y\      +rry3      +TA7hf(yi    )-^rhf(y2    )+-5rhf(y3   >, 


567 


81 


567 

1   (m-1)   8   (m-1)   8  ...    (m),    16  ...    (m),   3  ...    (m) ,   27  U£/  (m) , 
'4  y  y§   '-  -  hf(yj   )  -  —  hf(y2   )  +  j   hf(y^  ')  +  —  hf(y^  y) 


Since   21    =  x(t    )   each  component  of  the  essential  local  error  is  proportional  to  x(t  )  -  Z3 
m— 1  m 

and  the  estimator  gives  just  one  estimate 

„/,.  s  _   (m)     „(m) 
x(tm)    z3    =  y5 


yim)  +  0(h5) 
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IN  SEARCH  OF  A  ROBUST  INTEGRATION  ALGORITHM 

FOR  GENERAL  LIBRARY  USE : 

SOME  TESTS,  RESULTS  AND  RECOMMENDATIONS 

M.B.  Carver 

Atomic  Energy  of  Canada  Limited 

Chalk  River  Nuclear  Laboratories 

Chalk  River,  Ontario  KOJ  1J0 


ABSTRACT 

The  design  or  selection  of  an  integration  algorithm 
suitable  for  general  use  in  a  subroutine  library 
or  within  a  simulation  package  is  a  demanding 
process,  particularly  as  it  is  unrealistic  to 
support  a  number  of  routines  which  purport  to 
perform  the  same  task.   In  addition  to  satisfying 
the  criteria  of  excellence  of  documentation, 
reliability  and  efficiency  which  a  numerical 
analyst  would  require  of  an  integrator  to  be  used 
in  a  specific  project,  a  routine  destined  for 
general  use  must  be  exceptionally  robust,  as  the 
average  user  has  neither  the  desire,  nor  the 
specialized  knowledge  to  interact  effectively  with 
the  algorithm. 

This  paper  examines  the  degree  of  robustness  of 
selected  integration  algorithms  by  exposing  them 
to  a  test  profile  of  diverse  problems.   A  number 
of  common  areas  of  difficulty  are  identified,  and 
methods  used  to  promote  self  sufficiency  in  these 
areas  in  the  author's  sparse  matrix  integration 
package  GEARZ,  are  discussed. 

SUMMARY 

An  extremely  large  number  of  algorithms  have  been 
proposed  for  the  numerical  solution  of  the  ordinary 
differential  equation  initial  value  problem.   The 
design  or  selection  of  such  algorithms  for  use  in 
a  library  of  mathematical  subroutines  for  general 
scientific  and  engineering  use  is  difficult,  as  it 
is  impractical  to  maintain  a  large  number  of 
subroutines  which  attempt  to  do  the  same  job  with 
varying  degrees  of  success. 

Criteria  for  selection  must  include  accuracy, 
efficiency  and  ease  of  use,  thus  one  can  consider 
realistically  only  those  algorithms  which  have 
options  to  determine  an  optimal  integration  step 
size,  and  possibly  order,  by  means  of  a  built-in 
estimate  of  the  associated  truncation  error,  and 
are  presented  as  quality  software,  complete  with 
detailed  internal  and  external  documentation  which 
emphasizes  the  weaknesses  as  well  as  the  strengths 
of  the  method.   These  restrictions  considerably 

ice  the  possibilities,  but  a  number  of  candi- 
dates remain.   Typical  amongst  these  are  algo- 
rithm       i ,od  by  Gear  [  1  ]  ,  Hindmarsh  [2]  ,  Byrne[3], 
ne(4)  and  Krogh[5J,  all  of  which  satisfy  the 


Because  these  algorithms  are  reliable,  quality 
software,  numerical  analysts  have  incorporated 
them  in  a  multitude  of  applications  packages,  in 
which  the  user  is  shielded  from  the  complexities 
involved  in  management  of  the  integration.   This 
type  of  application  has  in  fact  been  their  greatest 
success .   To  the  uninitiated  user  of  a  subroutine 
library,  however,  their  correct  implementation  can 
be  seen  as  a  prohibitive  task. 

In  order  to  focus  further  on  quality  for  general 
use,  the  criterion  of  robustness  must  be  added,  a 
robust  algorithm  being  defined  as  one  which  pro- 
duces the  result  to  the  desired  accuracy  or  a 
clear  indication  of  failure,  requires  a  minimum  of 
effort  and  does  not  demand  clairvoyance  from  the 
user.   This  criterion  is  essential  because  the 
majority  of  projected  users  are  not  numerical 
differential  equation  experts,  and  therefore,  not 
qualified  to  make  decisions  on  fundamental  issues 
such  as  the  initial  step  size,  error  base,  spar- 
sity,  and  error  recovery.   Unfortunately,  as  such 
decisions  are  frequently  left  to  the  user's  dis- 
cretion in  the  guise  of  generality,  the  applica- 
tion of  this  robustness  criterion  eliminates  most 
of  the  above-mentioned  candidate  algorithms. 

This  paper  attempts  to  show  that  such  algorithms 
can  be  made  considerably  more  robust  at  the  ex- 
pense of  a  negligible  loss  of  generality  by  fur- 
ther automation  and  simplification  of  the  decision 
process  during  start  up,  integration,  error  pro- 
cessing, and  discontinuity  handling.   Several 
specific  areas  where  traditionally  required  user 
interaction  can  be  either  eliminated  or  reduced  to 
optional  status  to  improve  effectiveness  are 
discussed.   They  are: 

(a)  Initial  Step  Size  Selection:  A  routine  with 
efficient  step  size  adjustment  is  far  better 
qualified  than  a  user  to  provide  this  selec- 
tion. 

(b)  Tolerable  Error  Imposition:   An  algorithm 
normally  accepts  a  step  if  the  estimated 
local  truncation  error  in  Y.  satisfies 

€ .<TOL*Ybase  . ,  where  Ybase.  is  frequently 
defined  as  tne  maximum  attained  value  of  Y., 
or  made  available  as  an  array  for  user  spec- 
ification.  Again  users  have  no  criteria  on 

which  to  nominate  each  Ybase  .  ,  but  automatic 

J 
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iii    is   gen- 
to  pi  >v  Lde 
-    •  '    |y|    is  ef- 

if    this   differs    from   the 

• 

iluation  and  Handling:   Most 
irators  require  estimates  of  the 
in  matrix  from  time  to  time,  and  the 
often  left  to  speculate  on  a  choice 
;ow  these  should  be  obtained.   Sophisti- 

fficiency  by  codinq  their  own 
.bian,  but  within  a  general  package  this 
na   more  remote,  and  could  end 
it  than  numerical  evaluation. 
■  .1  nunerical  procedure  is  to  perturb 

Y   to  obtain  the  set  3F./3Y-, 
7  n  function  evaluation  calls.   This 
•   -zed;  for  example,  a  triJiaqonal 
matrix         may  be  evaluated  using  only 
three  calls,  a:.d  proportionate  savings  may  be 

lized  in  .sparse  matrices  [6]  .  Sparse 
matrix  Jacobian  evaluation  and  handling 
optimized  in  this  way  is  invaluable  for 

eral  PDE/ODE  packages  in  which  structure 
is  arbitrary. 

ontimiit'       -ion  and  Negotiation:   A 
ntion  definition  occuring  at  an 
unknown  tae  dependent  on  critical  rela- 
r.ships  c  .teqration  is  a 

,-al  phenomenon,  but  is  traumatic 
•  an  integrator  and  frequently  leads  to 
••  limit  or  termination.   A  robust  inte- 
jr  should  detect  such  a  discontinuity  and 
transcend  it,  issuing  a  suitable  terse  report. 

• .onal  user  interaction  to  facilitate 
smoother  handling  of  discontinuities  should 
be  made  available,  for  example  see  reference 
[7J. 

(e)  Stiffness  Detection:   The  most  common  way  a 
standard  non-stiff  integration  algorithm 
detects  stiffness  is  to  run  to  time  limit. 
This  is  unsatisfactory  as  the  results  pro- 
duced will  be  of  little  value.   A  far  less 
expensive  approach  is  to  report  suspected 
stiffness ;4]  and  either  refuse  to  waste 

e  computing,  or  if  possible, 
switch  to  a  stiff  algorithm. 

(f)  Calling  Sequence  and  Common  Blocks:   In  a 
package,  the  user  is  shielded  from  direct 
communication  through  the  integrator  param- 
eter list,  but  this  is  mentioned  here  to 
facilitate  library  or  package  interface 
design.   Any  integrator  calling  sequence 
requires  ten  parameters,  the  derivative 
subroutine,  dependent  variable  and  derivative 
vectors  and  their  size,  independent  variable 
and  interval,  tolerance,  warning  flag,  one 
working  storage  array  and  current  step  size. 

these  parameters  are  mandatory  and  all 
other  communication  may  be  through  optional 
common  blocks.   The  integrator  itself  can 
assign  parts  of  the  working  storage  array  in 
predictable  sequence,  and  choose  a  suitable 
method  for  handling  the  Jacobian  from  an 
assessment  of  the  equation  structure  and 
available  working  storage. 


A  number  of  results  of  both  academii  and  ap] 
tions  problems  are  given  to  illustrate  ti 
modifications  proposed  above  are  not  only  more 
robust,  but  frequently  more  efficient. 

Finally,  a  brief  description  is  given  of  the  Gl 
package,  which  incorporates  the  modifications 
discussed  into  a  combination  of  the  Hindmarsh- 
algorithms [2]  with  the  Curtis-Reid  sparse  matrix 
routines[6],  and  is  currently  used  on  the  Atomic 
Energy  of  Canada  Limited  subroutine  library  and 
the  F0RSIM(8],  and  MACKSIM[9]  simulation  pack, 
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ABSTRACT 


Let  us  consider  a  system  of  ordinary 
differential  equations  of  the  form 

■  ,  v  ' v    )  =  0  where  y  and  f  are  vector 

functions  of  x.   By  introducing  an  operator  T  such 
that  Tu  =  f(x,  u,  u',...,  u   )  we  have  Ty  =  0. 
Assuming  that  y   is  an  approximation  of  the 
solution  y(x)  a  generalisation  of  Newton's  method 
can  be  applied  to  improve,  under  certain 
conditions,  such  approximation  by  the  recursive 
algorithm  yi+1  =  y     -  I*(y  )   "Ty1  (i  =  0,  1, 
-,...)  and  assuming  that  T'(y  )    exists. 


In  the  present  case  we  can  use  such  an 
approach  in  a  numerical  fashion  as  follows.   After 
obtaining  by  any  method  of  integration  numerical 
approximations  y  on  a  discrete  set  of  points 
x   (n  =  1, .  .  .  ,  N)  we  interpolate  them  by  a 
convenient  function  P(x).   By  taking  this 
interpolant  as  the  first  analytical  approximation 
y  Newton's  process  is  applied  pointwise  in  order 
to  correct  by  iterations  the  discrete  approximations 
y  .   This  procedure  may  become  rapidly  convergent 
especially  in  some  stiff  problems  where  we  have 
obtained  so  far  promising  results. 
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EXTENDED  ABSTRACT 

A  portable  compiler  computer  program  is  de- 
veloped for  the  automatic  numerical  solutions  of 
ordinary  differential  equations.  The  inputs  to 
this  compiler  are  FORTRAN  statements  of  the  equa- 
tions (differential  and  algebraic)  and  of  the 
initial  conditions.  The  output  of  this  compiler 
is  an  object  program  written  in  FORTRAN.  The 
Automatic  Taylor  Series  (ATS)  method  of  solving 
ordinary  differential  equations  is  based  on  the 
premise  that  differentiation  of  functions  can  be 
performed  at  specified  points  by  simple  additions 
and  multiplications.  The  solution  of  very  com- 
plicated ordinary  differential  equation  can  be 
expanded  into  a  long  Taylor  series  with  ease. 
The  availability  of  very  long  Taylor  series  has 
led  to  new  understanding  of  the  convergence  pro- 
perty of  the  series  in  relation  to  the  locations 
of  singularities  and  to  the  radius  of  convergence. 
It  is  then  possible  to  set  an  error  limit  before- 
hand and  maintain  local  error  control  over  the 
computations  within  that  limit  up  to  almost 
machine  accuracy.  The  power  of  the  Automatic 
Taylor  Series  (ATS)  method  lies  in  the  speed  and 
accuracy  that  is  obtainable. 

The  foundation  of  the  Automatic  Taylor 
Series  (ATS)  method  is  the  replacement  of  diff- 
erentiation of  functions  by  simple  arithmetic 
operations.  The  first  publication  on  this 
simplification  was  by  Wilsonfl]  in  1949. 
Gibbons[2]  obtained  the  recursive  relations  for 
the  fully  automatic  differentiation  of  some 
simple  fucntions  in  1960.  A  summary  of  power 
series  method  was  published  by  Leavitt[3]  in 
1966.  Application  of  power  series  to  the  auto- 
matic solutions  of  ordinary  differential  equa- 
tions was  first  discussed  by  Moore[4]  in  1966. 
Barton,  et  al [ 5]  in  1970  first  reported  imple- 
mentation of  an  automatic  solution.  That  auto- 
matic solution,  by  Barton,  et  al ,  was  obtained 
through  a  compiler  program  that  was  written  in 
a  machine-dependent  language.  One  of  the  pre- 
sent authors  published  an  account  of  a  compiler 
program  written  in  FORTRAN  for  the  automatic 
solutions  of  ordinary  differential  equations[6] 


*  This  work  is  in  part  supported  by  N.S.F.  Grant 

'  j 78-03022. 

♦  Present  address:  Department  of  Computer  Science, 
Iowa  State  University,  Ames,  Iowa  50011 


in  1974.  The  principal  differences  between  this 
and  the  one  by  Barton,  et  al  are  the  calculation 
for  the  series'  radius  of  convergence  and  the 
concomtitant  error  control.  This  compiler  was 
written  in  FORTRAN  with  hope  that  it  would  be 
portable.  This  proved  to  be  optimistic,  not 
factual.  There  are,  at  present,  two  versions  of 
this  compiler:  one  for  the  CDC-computer,  and  one 
for  the  IBM-computer.  These  two  versions  are 
quite  distinct. 

The  present  paper  is  the  report  of  a  por- 
table compiler  program  for  the  automatic  solu- 
tions of  ordinary  differential  equations.  This 
program  is  written  in  FORTRAN  with  special  treat- 
ment of  machine-dependent  parameters  to  obtain 
portability.  Parameters  such  as  bits/character, 
character/word,  single-precision  limit,  and 
double-precision  limit  are  stored  in  a  single 
Blockdata  location  for  easy  access  and  quick 
change.  Since  the  bits/character  and  word  length 
are  machine-dependent,  we  have  also  Instituted 
a  numerical  code  for  character  strings.  This 
allows  for  easy  character  string  manipulations 
us  inn  integer  arithmetic;  thus,  we  have  overcome 
the  difficulty  of  character  string  manipulation 
on  certain  computers.  As  development  proceeds, 
testing  is  being  done  on  computers  from  several 
manufacturers:  IBM,  CDC,  PDP,  and  Xerox  Sigma. 

The  input  to  this  compiler  program  consists 
of  four  blocks  of  user  specified  information: 
the  statement  of  differential  equations  and  three 
blocks  of  instructions  to  be  inserted  into  the 
object  program.  Each  of  the  data  blocks  ends 
with  a  data  terminator  ($),  and  only  one  data 
terminator  may  appear  on  a  line.  The  differential 
equations  are  entered  in  the  first  block,  with 
(DIFF)  as  the  differential  operator.  For  example, 
DIFF(Y,X,2)  denotes  y"(x).  The  rest  of  the  ex- 
pressions are  entered  in  typical  FORTRAN  state- 
ments. 

For  example,  to  solve  the  problem: 


y"  =  y'  sin(y)  exp  (x); 
y(0)  =  1,  y'(0)  =  -1  on  [0,4] 
the  input  would  be 
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Column  7 

DIFF(Y,X,2)  ■  DV*SIN(Y)*EXP(X) 
DY  ■  DIFF  (Y.X.I)   $ 

START  =0.0 
4.0 
1.0 
-  1.0   S 


END 

Y  (1) 

Y  (2) 
S 


This  is  an  example  of  the  simplest  possible  input. 
The  use  of  the  first  and  third  data  blocks  is  man- 
datory, while  the  use  of  the  second,  and  fourth 
data  blocks  is  optional.  The  sophisticated 
user  may  control  the  solution  of  very  complicated 
problems  through  these  optional  data  blocks. 

The  second  data  block  may  be  used  to  enter 
non-executable  FORTRAN  statements,  such  as  COMMON, 
DATA,  or  SUBROUTINE,  into  the  object  program  at 
the  appropriate  location.  Using  the  SUBROUTINE 
statement  allows  the  user  to  interlace  many  com- 
plicated programs  together.  The  third  data  block 
is  used  to  define  the  solution  interval  and  initial 
conditions.  The  fourth  data  block  may  be 
used  for  such  things  as  changing  the  amount  of 
printout,  starting  and  stopping  the  solution  at 
any  given  point  in  either  the  dependent  or  inde- 
pendent variables,  and  the  calling  of  other  sub- 
routines. Problems  with  piecewise  analytic  solu- 
tions can  be  solved  with  ease. 

This  compiler  program  reads  the  input  and 
generates  a  FORTRAN  object  program,  which  sub- 
sequently is  compiled  and  run  to  solve  the  speci- 
fied problem.  The  object  FORTRAN  program  gen- 
erated by  this  compiler  conforms  as  closely  as 
is  possible  to  the  general  structure  suggested  by 
Hull  and  Enright[7].  As  the  first  data  block  of 
equations  are  read  in,  this  compiler  produces  a 
list  of  tokens.  Each  token  represents  a  con- 
stituent in  the  equations.  Variables  and  con- 
stants are  stored  in  arrays  as  integers  with  the 
corresponding  tokens  pointing  to  the  proper 
element  in  the  arrays.  The  token  produced  for 
a  function  is  an  integer  code  representing  that 
function. 

The  list  of  tokens  are  checked  for  gramma- 
tical correctness  and  put  in  Polish  postfix  form. 
The  grammar  used  is  an  LL(1)  grammar,  which  has 
the  property  that  the  top  stack  symbol  and  the 
input  token  uniquely  determine  which  production 
of  the  grammar  should  be  applied  next.  The 
postfix  expression  is  then  arranged  in  triplets 
of  operator-operand-operand,  from  which  the 
final  object  program  is  generated. 

This  compiler  is  a  significant  improvement 
over  the  earlier  compilers  for  the  automatic 
solution  of  ordinary  differential  equations. 
Not  only  is  it  portable  between  different  com- 
puters, by  the  simple  change  of  an  easily  access- 
ible blockdata,  there  are  other  important  features. 
The  calculation  for  the  series'  radius  of  con- 
vergence has  been  improved  such  that  the  order  of 
the  singularity  in  the  complex  plane  of  the  solu- 
tion function  can  be  estimated  with  a  certain 
degree  of  accuracy.  We  have  also  included  in  this 
compiler  the  ability  to  vary  the  ratio  of  step- 
size  to  the  radius  of  convergence  and  variable 
order  in  the  numerical  integration.  Other 


improvement  include  the  choice  of  single-  or 
double-precision,  and  comprehensive  diagnostic 
error  messages . 

Copies  of  this  compiler  will  be  made  avail- 
able to  researchers  in  tlie  field  of  numerical 
ordinary  differential  equations  upon  reouest, 
when  all  of  the  major  bugs  have  been  removed  and 
full  documentation  is  ready. 
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ROSENBROCK-TYPE  METHODS  FOR  THE  NUMERICAL  SOLUTION  OF  STIFF  SYSTEMS 

Peter  Kaps 
University  of  Innsbruck 


Abstract:  Generalized  Runge-Kutta  methods  for  the  numerical  integration  of  stiff  systems 
are  investigated.  A  simple  way  of  obtaining  the  order  conditions  is  given.  Numerical 
results  of  two  embedded  (3)4  methods  (GRK4A,  A-stable,  and  GRK4T,  A(89°) -stable  with 
small  truncation  errors)  are  presented.  Comparing  computing  time  both  methods  are 
competitive  with  the  GEAR  program. 

In  this  paper  I  want  to  give  a  survey  of  the  results  of  my  thesis  [ 1 ]  and  two  articles 
I  wrote  together  with  P.  Rentrop  [2]  and  G.  Wanner  [3]. 

For  the  numerical  solution  of  the  initial  value  problem 

y' (x)  =  f(y(x)),  y(xQ)  =  yQ 
defined  in  an  n-dimensional  real  or  complex  space  the  following  method  is  considered: 
E  =  I-Yhf ' (yQ) 
Ek1  =  hf(yQ) 
Ek2  =  hf(yQ+a21k1)  +  hf(y0)Y21kl 


Ek 


s  =  hf'y0+  giV---+as,s-iks-i)  +  hf,<y0)(YsiV--  -Vs-iVi } 


s 
y.  =  y   +   T   c  .  k  .  . 

1  1      1Q  .  L  .  11 

1=1 

Here  y,  a..,  y .  .  ,  c.  are  real  coefficients,  h  denotes  the  step  size,  f'(y  )  the  Jacobian, 

I  the  identity  matrix  and  s  the  number  of  stages.  The  method  needs  one  evaluation  of  f ' , 
the  solution  of  one  linear  system  for  s  right-hand  sides  and  up  to  s  evaluations  of  f. 
The  matrix-vector  multiplications  can  be  avoided  by  a  suitable  transformation. 

Similar  methods  were  first  investigated  by  H.H.  Rosenbrock  [4],  G.  Wanner  intoduced  the 

coefficients  y.   ..  So  we  call  these  methods  Rosenbrock-Wanner  or  short  ROW  methods. 
ID 

The  equations  of  condition  have  a  structure  similar  to  Runge-Kutta  methods.  In  [1],  [3] 
they  are  derived  with  the  help  of  Butcher  series.  In  order  to  illustrate  how  easy  they 
can  be  obtained,  we  give  as  an  example  the  equation  for  the  tree 
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1 
The  tree  t  has  7  nodes  and  corresponds  to  an  elementary  differential  of  order  7.  Every- 
body who  has  ever  tried  to  obtain  the  equations  of  condition  by  Taylor  series  expansion 
will  appreciate  the  following  simple  procedure:  Attach  to  each  node  of  t  a  summation 
index  i, j ,k. . .  .  Put  0  !  .  :  =  a . .+y .   . +y6 . . .  Then  we  must  have 


.3.2  c  .a  .   .  0'. ,  a,  ,  a.  a  .  0  '   =  1  . 

* — rn — i 1    i]  ]k  kl  km  in  no 

l, j,k,l,m,n,o      J 

The  product  of  integers  before  the  sum  is  Butcher's  coefficient  r(t),  but  let  us  call  it 

T(t)  here  in  order  to  avoid  any  confusion  with  the  coefficient  y   of  the  ROW  method.  Then 

there  is  a  sum  over  all  summation  indices  attached  to  the  nodes  of  t.  The  summands  are 

products  containing  the  factors  c.  for  the  root  of  the  tree,  a    (or  0'  )  whenever  a 

l  pq      pq 

multiply  (or  singly)  branched  node  p  is  connected  directly  upwards  with  a  node  q. 
The  matrix  (a.  .)  has  elements  *  0  below  the  diagonal  only,  but  (0!  .)  has  non-zero 


diagonal  elements.  Putting  0.  .  =  0!  . -y6.  •  one  obtains 

'   l]     lj    ID 

1  =  \\    •  fc2  =N^I    and  t3  =  \^ 


where  t,  = 


are  the  trees  which  result  from  t  by  omitting  1  or  2  singly  branched  nodes.  By  this 
procedure  the  number  of  summands  is  considerably  reduced.  The  equations  of  condition 
obtained  in  this  way  are  listed  in  [1],  [2],  [3]  up  to  order  5. 


ROW  methods  have  good  stability  properties.  The  stability  function  is  a  rational 

s 
approximation  to  the  exponential  function  with  denominator  (1-yz)  .  If  the  numbei 

stages  does  not  exceed  the  order  the  numerator  P  is  given  by 

(-T)1 
(k-1)!  • 


s   ,   k      ,   ,i 

P  =  I      z"  I       (?)  *=**- 

k=0    i=0 


k  1 

The  coefficient  of  z   is  a  generalized  Laguerre  polynomial  in  —  . 

Unfortunately,  one  can  not  obtain  r-values  leading  not  only  to  small  truncation  errors 
but  also  to  A-stability.  Therefore,  we  worked  out  A (a) -stable  methods  with  small 
truncation  errors  and  A- (or  L-) stable  methods. 

Reasonable  integration  methods  need  also  a  step  size  control.  We  investigated  in  [1], 

[3]  methods  using  passive  Richardson  extrapolation  (that  is  two  steps  with  step  size  h 

and  one  with  2h  for  error  estimation) .  The  following  table  gives  the  minimal  number  of 

stages  s   necessary  for  a  method  of  order  p: 
m  r 


p 

4 

5 

6 

s 

m 

3 

5 

6 
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In  [2]    embedded  methods  of  orders  (3)4  were  studied.  The  solution  of  order  4  was  used  for 
step  continuation.  Four  stages  were  necessary,  but  only  3  function  evaluations  because 


'31 


a41,  a32 


a42'  a43 


0 


Both  types  of  methods  work  well.  As  an  example  the  embedded  methods  GRK4A  (A-stable) 

and  GRK4T  (small  truncation  error,  A(89.3  ) -stable)  are  compared  with  GEAR  for  the  25 

test  examples  of  Enright,  Hull,  Lindberg  [5].  The  Jacobian  was  calculated  by  difference 

approximation  which  needs  n  function  evaluations.  For  step  size  control  the  formula 

(TOL/EST)  1   ,  0.5  h  ,,  <  h     <1.5h,.  was  used.  EST  =l|  y,  ,--y.  ,,l| 

-Mold  -Mold 


h     =  0.9  h  n, 
new        old 


0.5  h  n ,  <  h     <1 .5  h  .  , 

old  —   new  —      old 


is  the  estimate  (0   and  y.  denote  the  solutionsof  orders  3  and  4  respectively  and  II    II 

the  maximum  norm)  and  TOL  is  the  given  tolerance.  The  safety  factor  0.9  and  the 

limitation  of  h     are  fixed  by  experience.  A  new  step  is  accepted  if  the  error  does  not 
new  J         c  c  c 

exceed  the  tolerance.  For  components  smaller  than  1  the  absolute  error  was  used,  and  for 

the  others  the  relative  error.  In  the  following  table  the  computing  time  TZ  in  seconds 

-4 
and  the  total  number  of  function  evaluations  TF  are  listed  for  TOL  =  10    and  initial 

-3 
step  size  10 


All  examples 

Allexamples  except  B5, 

E4 

TZ     TF 

TZ      TF 

GRK4A 

18.85 

14428 

16.02 

12431 

GRK4T 

19.23 

14181 

14.21 

11043 

GEAR 

44.21 

9099 

21  .57 

5423 

(The  computationswere  performed  in  FORTRAN  single  precision  with  a  38  bit  mantissa 
(11  decimals)  on  the  TR440  of  the  Leibniz-Rechenzentrum  of  the  Bavarian  Academy  of 
Science).  GEAR  has  difficulties  in  B5,  where  computing  time  reaches  21.23  seconds.  In  E4 , 
GRK4T  is  approximately  4  times  worse  than  GRK4A.  This  is  due  to  a  positive  eigenvalue  in 
E4  and  disappears  for  an  exact  evaluation  of  the  Jacobian. 
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We  consider  an  initial  value  problem  for  a 
stiff  system  of  first  order  differential  equations 
which  occurs  in  the  modeling  of  flows  of  ions  in 
the  upper  ionosphere.  The  problem  is  that  we 
know  the  values  for  some,  but  not  all,  of  the 
initial  conditions.  The  dependent  variables  for 
which  we  are  solving  are  electron  temperature, 
heat  flow,  and  stress.  In  this  context  stress 
means  temperature  anisotropy,  i.e.  the  difference 
between  temperature  parallel  and  perpendicular 
to  the  earth's  magnetic  field.  Realistic 
initial  values  for  temperature  and  heat  flow  are 
known  from  satellite  measurements.  The  stress 
value  is  not  known  because  satellite  measurements 
have  generally  assumed  an  isotropic  temperature 
distribution.  We  shall  see  that  we  can  exploit 
the  stiffness  of  the  system  to  calculate  a 
realistic  stress  value. 

The  choice  of  initial  values  is  critical  to 
the  nature  of  the  solution,  particularly  in  the 
early  stages.  Most  choices  of  initial  conditions 
give  rise  to  solutions  which  have  a  rapid,  short- 
term  fluctuation  which  soon  dies  out.  This 
behavior  is  typical  of  stiff  systems,  but  not  of 
the  physical  system  under  consideration.  For  a 
typical  (arbitrary)  choice  of  initial  stresses, 
temperature  fluctuations  on  the  order  of  100  K 
in  one  kilometer  are  observed  in  the  early  stages 
of  the  numerical  solution.   (Note:  the  physical 
problem  is  an  initial  value  problem  in  the  mathe- 
matical sense  only.  The  independent  variable  is 
altitude,  not  time.  We  start  at  a  convenient 
initial  altitude  and  march  outward  from  the  earth 
a  distance  of  some  ten  thousand  kilometers.) 
Satellite  measurements  indicate  that  such  a 
fluctuation  is  physically  unrealistic.  Tempera- 
ture changes  of  this  magnitude  actually  take 
place  over  tens  or  hundreds  of  kilometers.  The 
short  term  fluctuations  in  the  numerical  solution 
are  caused  by  the  wrong  choice  of  initial  stress. 
Therefore,  we  wish  to  choose  our  unknown  initial 
stress  in  such  a  way  as  to  obtain  a  solution  in 
which  rapid,  short-term  fluctuations  are  mini- 
mized or  eliminated.  Such  a  solution  is  said  to 
be  in  stiff  equil ibrium.  A  general  procedure  for 
determining  initial  conditions  for  which  the 
solution  is  in  stiff  equilibrium  is  developed  and 
justified  here.  We  consider  the  linear  problem 
first  and  show  how  the  unknown  initial  values  can 
be  calculated  by  a  direct  procedure.  Then  we 
show  how  the  non- linear  problem  can  be  solved  by 
a  procedure  which  iterates  the  direct  procedure 
of  the  linear  problem.  Finally,  we  give  numerical 


results  showing  that  the  procedure  does  work  for 
the  problem  under  consideration. 

MATHEMATICAL  DISCUSSION 
We  consider  first  the  linear  problem 
y'  =  Ay  +  b.  We  assume  that  the  system  is  stiff, 
i.e.  that  one  or  more  of  the  eigenvalues  of  A 
has  large  negative  real  part,  and  the  other  eigen- 
values are  comparatively  small  in  absolute  value. 
Let  A,,Ap,.. 

that  they  are  ordered  so  that  A-,,...., 
"small"  eigenvalues  and  A. +, ,...., A  ar 

"stiff"  eigenvalues,  i.e.  those  having  large 

negative  real  part.  Given  any  initial  condition 

y(x  )  =  y  ,  the  initial  value  problem  has  a 
oo 

unique  solution.  If  A  is  simple  the  solution  is 

.,  -   ?  „(J)A 


.A  be  the  eigenvalues,  and  assume 

A,    are  the 
re  the 


I  vVJ'y\(x) 
j=l  J 


where 


■1    . 


y>)  = 


fcJe*p(xj(x-xo»-fyr if  xj  * 


Cj  +  b\(x-xQ] 


if  A.  =  0. 


v  J'  is  the  eigenvector  corresponding  to  A.,  the 

d.  are  constants  determined  by  b  and  A,  and  the  c- 

J  J 

are  constant  determined  by  the  initial  vector  y  . 

In  our  problem  we  assume  that  some  of  the  initial 
values  (components  of  y  )  are  known  and  others 
are  unknown.  We  wish  to  specify  these  unknown 
values  in  such  a  way  that  rapid,  short-term 
fluctuations  in  the  solution  are  minimized.  These 
short-term  fluctuations  are  caused  by  those  terms 
of  the  form  cexp(x.(x-x  ))  for  which  A.  has 

large  negative  real  part.  We  can  eliminate  these 
terms  by  specifying  the  initial  values  in  such  a 
way  that  c-  =  0,  j  =  k+1 .....  ,n .  A  solution 

having  this  property  is  said  to  be  in  stiff 
equilibrium.  Not  surprisingly,  in  order  to  obtain 
a  solution  which  is  in  stiff  equilibrium,  the 
number  of  initial  values  which  we  allow  to  vary 
must  be  equal  to  be  number  of  stiff  eigenvalues. 
This  restricts  the  class  of  problems  which  can  be 
solved,  but  the  restriction  is  not  as  severe  as  it 
might  at  first  seem  because  there  is  usually  some 
latitude  in  the  choice  of  stiff  eigenvalues..  For 
example,  if  A  has  eigenvalues  1,  0,  -1,-10  ,-10  , 
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-10^,  we  may  choose  to  designate  one,  two,  or  three 
eigenvalues  as  stiff. 

By  a  straightforward  procedure  we  can  deter- 
mine the  initial  values  which  give  a  solution  in 

stiff  equilibrium.  First,  yV(x  )  =  c.-y.b., 

_  1   J  o     J  J  J 

j=l,....,n,  where  u,  =  \~     if  A,  /  0  and  u.  =  0 

if  •  •  =  0.  These  equations  can  be  expressed  in 
matrix  form  as  y*(x  )  =  c  -  Mb\  where  M  is  the 

diagonal  matrix  whose  jth  main  diagonal  entry  is 

p..  Thus  y  =  V(c-mId),  where  V  is  the  matrix 

3  °        (i) 

whose  jth  column  is  v   .  This  matrix  equation 

can  be  written  in  block  form 


(1) 


where 


11 


'21 


V 


12 


'22 


d-r 


e-s 


z  consists  of  those  components  of  y 


which  are  unknown,  e  consists  of  those  components 
of  c  which  are  to  be  set  to  zero,  and  r  and  s  are 
components  of  the  known  vector  Mb.  For  any 
specified  w  we  wish  to  find 


z  such  that  this 
o 


system  has  a  unique  solution  c 


wi  th  e  =  0. 


Such  a 


z  must  satisfy  the  equations 


Vn(d-r) 
V2](d-r) 


V]2s 
V22s 


obtained  from  (1)  by  setting  e  =  0.  These  equa- 
tions have  a  unique  solution  if  and  only  if  V,, 


is  nonsingular.  If 


V,,  is  nonsingular  we  can 


solve  the  first  equation  uniquely  for  d-r.  We 
can  then  substitute  d-r  into  the  second  equation 
to  calculate  z  . 

The  procedure  which  we  have  just  outlined  in- 
cludes the  calculation  of  the  eigenvalues  and 
eigenvectors  of  A  and  the  solution  of  two  linear 
systems.  This  would  be  costly  for  a  large  system 
of  differential  equations,  but  for  a  system  of 
modest  size  (one  of  ten  equations,  say)  these 
operations  are  not  expensive. 

Now  consider  the  nonlinear  problem 
y'  =  f(y)>  y(xn)  =  y  >  where  f  is  assumed  twice 

continuously  differentiable.  We  are  concerned 
here  with  the  behavior  of  the  solution  in  the 
vicinity  of  y.  Near  y  the  solution  to  the  non- 
linear problem  is  well  approximated  by  the  solution 
to  the  linear  problem 

y"  =  f(yo}  +  fy   (yo)(y-yo)'  y(xo>  =  yo  • 

Ay+b,  where 

As  in  the 


This  problem  has  the  form  y' 
A  =  '¥-   (y  )  and  b  =  f(y  ] 


3f  /   v 

■z-   (y  )y 

;jy  uo';o 


linear  case,  y  contains  some  unknown  components, 
so  we  cannot  simply  proceed  as  in  the  linear  case. 
Instead  we  must  use  an  iterative  procedure.  We 
make  an  initial  guess  of  y  ,  use  this  initial 
guess  to  calculate  A  and  b,°and  proceed  as  in  the 
linear  case  to  calculate  a  new  guess  of  y  .  We 

repeat  this  procedure  until  (hopefully)  it  con- 


verges. 

In  the  nonlinear  case  the  partitioning  of 
the  eigenvalues  is  complicated  by  the  fact  that 
the  matrix  A  changes  from  iteration  to  iteration, 
and  the  eigenvalues  change  with  it.  Thus,  a 
partition  which  is  reasonable  on  one  iteration 
may  not  be  reasonable  on  the  next.  It  is  im- 
practical to  change  the  number  of  initial  values 
which  are  being  adjusted  from  one  iteration  to 
the  next,  so  we  must  decide  at  the  beginning  how 
many  we  wish  to  vary  and  stick  with  that  number. 
If  we  decide  to  let  i  initial  values  vary,  then 
at  each  iteration  we  place  the  i  "stiffest" 
eigenvalues  in  the  "stiff"  set  and  all  others  in 
the  "small"  set.  Once  the  iterations  have 
converged  to  some  y  ,  we  can  check  whether  the 
partition  is  reasonable  by  examining  the  eigen- 

3f 

values  of  —  (y J .  We  can  also  monitor  the 

ay         0 

partition  at  intermediate  iterations,  but  this  is 
not  necessary.  All  that  really  matters  is 
whether  the  final  partition  is  reasonable. 

NUMERICAL  RESULTS 

We  present  here  some  numerical  results  for 
the  system  of  transport  equations  discussed  in 
the  introduction.  Consider  the  following  initial 
conditions:  altitude  =  1500  km,  temperature  = 
2000  K,  heat  flow  =  -1.943  x  10-5  erg(cm)-2 

(sec)   .  The  unknown  initial  condition  is  stress. 
The  following  table  shows  the  stress  values 
calculated  by  the  method  proposed  here. 


Iteration 

0 
1 
2 
3 


Stress  (K) 

0.0 
-5.159232 
-5.163496 
-5.163496 


We  see  that  the  procedure  converges  to  seven 
decimal  places  in  two  iterations.  At  each 
iteration  the  Jacobian  matrix  was  estimated  by 
numerical  differencing.  The  eigenvalues  and 
eigenvectors  were  calculated  by  the  IMSL  sub- 
routine EIGRF,  which  uses  a  shifted  QR  algo- 
rithm. 

The  initial  values  have  been  partitioned 
into  three  known  values  (counting  altitude,  the 
independent  variable)  and  one  unknown  value.  In 
order  for  this  to  be  a  reasonable  partition  we 
must  be  able  to  partition  the  Jacobian  into  a 
set  of  three  "small"  eigenvalues  and  a  set  of 
one  "stiff"  eigenvalue.  The  following  table 
shows  the  eigenvalues  of  the  Jacobian  computed 
on  the  final  iteration. 


eigenvalues 

7.18  x  io"10 
0.0      g 
-4.47  x  10  * 

-7.80  x  io"6 


scale  heights 


2.24  x  io  cm  =  2240  km 
1.28  x  lo5cm  =  1.28  km 


If  we  designate  the  first  three  eigenvalues 
"small"  and  the  fourth  one  "stiff",  then  the 
"stiff"  eigenvalue  is  over  1000  times  larger  in 
absolute  value  than  any  of  the  "small"  ones. 
Thus  the  partition  is  reasonable. 

The  fourth  eigenvalue  shows  that  the  system 
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is  quite  stiff.   It  looks  small  but  it  is 
actually  large,  given  the  units  of  the  problem. 
It;  the  solution  the  term  which  corresponds  to 
eigenvalue  \   decays  by  a  factor  e  in  the  distance 

-■'    .  This  distance  is  commonly  called  the  scale 
f  the  term.  We  are  using  cgs  units,  so 
ale  height  is  in  centimeters.   If  we  express 
tnis  distance  in  kilometers,  then  the  scale 
height  of  the  term  corresponding  to  the  fourth 
eigenvalue  is  1.28  km.  This  is  an  extremely  short 

?nce,  given  that  we  are  integrating  out  some 
ten  thousand  kilometers.  By  contrast,  the  scale 

tit  associated  with  the  next  largest  eigenvalue 

er  2000  • 

Finally,  we  have  to  ask  whether  the  answer 
en  by  tnis  procedure  is  meaningful.   Is 
63496  really  the  "right"  initial  stress 
value?  Yes,  it  is.   If  we  integrate  outward 
using  this  initial  value  we  get  smooth  profiles 
wnich  do  not  show  any  short  term  fluctuations. 
The  use  of  any  other  initial  stress  value  leads 
to  Iarqe  initial  fluctuations  in  the  profiles 

are  unrealistic  from  a  physical  standpoint. 
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A  COMPARISON  OF  MKTHODS 
FOR  COMPUTING  STURM- 
LIOUVILLE  EIGENVALUES 

J.W.  PAINE 
Australian  National  University 


An  important  source  of  Initial  value  problems 
arises  from  the  use  of  shooting  methods  to  solve 
Sturm- Li ouvi lie  eigenvalue  problems  of  the  form 


x  e  (a,b) 


(l.a)   - (pu) '  +  qu  =  ' ru 

(l.b)      a   u(a)  +  Bu(a)  =  0 
y   u(b)  +  6u(b)  =  0 

(1.  c)     u,  pu  e  C[a,b] 


Because  of  the  large  variety  of  methods 
available  for  approximating  the  eigenvalues  of 
this  problem,  and  because  of  their  different 
underlying  assumptions  on  which  the  methods  are 
based,  it  is  important  to  know  and  to  understand 
the  different  properties  of  the  eigenvalue 
approximations  which  will  be  obtained. 

The  most  important  property  that  any  method 
should  possess  is  convergence,  and  (if  we  assume 
that  each  method  is  parameterized  over  the  set 
of  positive  integers)  in  it's  simplest  form  this 
is  expressed  as 


Um  X 


(N). 


(N) 


where  X   is  the  <'th  exact  eigenvalue  and  X 

<  "  K 

is  the  K'th  eigenvalue  of  the  N'th  approximat- 
ing problem.   But  although  such  basic  converg- 
ence results  are  necessary,  they  are  not 
altogether  useful  since  they  do  not  really  give 
much  information  about  the  approximations  that 
will  be  obtained  for  a  fixed  finite  value  of  N. 
As  a  result  of  this,  the  approximation  results 
for  most  methods  are  refined,  and  given  in  the 
form 


IX  -  X(N)|  <  C  N"P 


;  N  >  N  (k) 


where  C  generally  depends  on  <  but  is  independ- 
ent of  N. 

For  many  situations  such  estimates  are 
sufficient  to  guide  one's  choice  of  method. 
However  since  these  results  are  local  convergence 
results  in  the  sense  that  for  a  fixed  value  of 
k,  we  can  find  an  N  large  enough  such  that  the 
convergence  estimate  is  valid  for  every 

.  ,  there  are  situations  when  the  information 
given  by  such  local  convergence  results  will  be 


insufficient  or  even  misleading,  when  comparing 
different  methods.   One  such  situation  is  when 
there  is  a  large  number  of  eigenvalues  to  be 
approximated  and  there  is  a  definite  practical 
need  to  keep  the  size  of  N  as  small  as  possible. 

An  example  of  this  type  of  problem  arises 
in  the  use  of  spherically  symmetric  models  of 
the  earth  to  obtain  information  on  the  structure 
of  the  mantle  from  known  values  of  the  torsional 
free  oscillation  frequencies;   i.e.  the  inverse 
eigenvalue  problem.   One  approach  for  solving 
this  problem  is  to  choose  a  set  of  possible 
structures  and  by  finding  how  well  the  eigen- 
frequencies  of  each  of  these  models  match  the 
desired  values,  the  "best  possible"  model  (or 
models)  from  the  given  set  can  be  chosen.   The 
difficulty  is  that  the  number  of  eigenvalues 
over  the  whole  set  of  possible  structures  becomes 
quite  large  and  so  the  time  and  computing  resour- 
ces spent  in  finding  the  model's  eigenfrequencies 
needs  to  be  optimised  in  terms  of  the  information 
obtained. 

What  is  needed  to  improve  the  error  bounds 
is  to  give  them  a  more  global  character,  and 
one  possible  means  for  extending  the  bounds  to 
show  this  is  to  refine  the  bound  so  that  the 
dependence  of  C  on  k  becomes  explicit.   Thus  for 
the  purposes  of  comparison,  we  adopt 

(N) 


X' 


<  C  N  FX 


"Pi«l 


N  >  N  (K) 
ov  ' 


(where  C  is  now  independent  of  both  k  and  N)  as 
the  standard  error  bound. 

If  we  consider  the  bounds  obtained  in  this 
form  for  a  number  of  representative  methods 
(Rayleigh-Ritz,  Prufer  substitution,  modified 
Prufer  substitution,  finite  difference  and  a 
similar  operator  approach)  a  number  of  features 
of  the  methods  and  the  resulting  approximations 
become  apparent . 

The  first  point  is  that  since  the  rate  of 
growth  of  the  error  of  many  methods  is  directly 
related  to  some  high  order  derivative  of  the 
eigenfunction  (the  larger  the  order  of 
convergence  p,  the  higher  the  order  of  derivative) 
it  can  be  preferable  to  use  a  method  with  a 
lower  order  of  convergence  to  obtain  better 
approximations  of  the  larger  eigenvalues  at  the 
expense  of  losing  the  high  accuracy  approximations 


41-1 


of  the  smaller  eigenvalues. 

The  second  point  is  that  some  methods  (such 
as  modified  Prufer  substitution)  perform  better 
(i.e.  the  error  behaves  more  uniformly)  if  the 
original  eigenvalue  problem  is  restated  in  Liou- 
ville  normal  form 


(2. a) 
(2.b) 


Av 


t  €  (0,1) 


a*v(0)  +  6v(0)  =  0 
Y*v(l)  +  6v(l)  =  0 

(2.c)     v,  v  e  C[0,1] 

The  final  point  is  that  for  the  considered 
approximation  methods,  it  generally  occurs  that 
the  improved  uniformity  of  the  approximation 
(if  any  is  possible)  is  obtained  at  the  expense 
of  requiring  stronger  regularity  conditions  on 
the  problem  (e.g.  the  coefficients  p  and  r  should 
be  sufficiently  highly  differentiable  so  that 
the  problem  can  be  transformed  to  normal  form  (2)). 
Thus  it  is  desirable  to  find  which  methods  will 
give  the  best  results  for  a  given  set  of 
regularity  conditions.   Although  such  an  extens- 
ive analysis  has  not  been  undertaken,  particular 
emphasis  has  been  on  the  similar  operator 
approach  (based  on  transforming  (1)  to  a  quasi- 
normal  form  and  then  replacing  the  single 
coefficient  function  by  a  piece-wise  constant 
approximation,  the  resulting  problem  is  then 
analytically  solvable)  which  gives  uniform 
approximations  under  very  weak  regularity 
conditions. 
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THE  NUMERICAL  SOLUTION  OF  SECOND  ORDER  SYSTEMS  OF  ODES 


Organizer:   W.  H.  Enright 

Dept.  of  Computer  Science 
University  of  Toronto 

Panelists:   D.  G.  Bettis 

Dept.  of  Aerospace  Engineering 

and  Engineering  Mechanics 
University  of  Texas  at  Austin 

P.  Deuflhard 

Institut  Fur  Angewandte  Mathematik 

Universitat  Heidelberg 

F.  T.  Krogh 

Jet  Propulsion  Laboratory 

California  Institute  of  Technology 


The  numerical  solution  of  second  order  systems 
is  one  of  the  most  important  application  areas  for 
initial  value  methods.   Special  numerical  methods 
for  second  order  equations  have  been  in  use  for 
over  fifty  years  and  these  methods  continue  to  be 
improved  and  developed  in  parallel  with  methods 
for  first  order  systems.   Two  areas  where  these 
problems  frequently  arise  and  where  special 
methods  are  heavily  used  are  celestial  mechanics 
and  structural  dynamics.   Consequently  the  analy- 
sis and  development  of  new  methods  is  often 
tailored  to  classes  of  problems  from  these  areas. 
Because  of  the  gain  in  efficiency  that  is  possi- 
ble when  the  differential  equation  is  inde- 
pendent of  y'f  there  are  generally  two  classes 
of  methods  developed  for  second  order  equations. 
The  first  class  of  methods  are  suitable  for 
systems  of  the  form: 


y"=f(x,y).  y(x0)=y0,  y'(x0)=y'   ; 


(I) 


and  the  second  class  is  suitable  for  general 
systems: 


y"=f (x.v.y'),  y(xn)=y.,  y'(x.)=y" 


(ID 


The  members  of  this  panel  have  each  contri- 
buted to  the  improvement  and  development  of 
numerical  methods  for  second  order  problems. 
Professor  Bettis  has  recently  developed  (with 
K.  Horn)  efficient  embedded  Runge-Kutta-Nystrom 
methods  for  second  order  systems  (Bettis  and  Horn 
(1978)).   He  has  also  recently  been  involved  in 
the  development  of  exponentially  fitted  Runge- 
Kutta  formula  pairs.   Professor  Deuflhard  has  had 
considerable  experience  with  extrapolation  methods 
for  seccnd  order  equations  and  has  recently  de- 
veloped a  class  of  exponentially  fitted  extrapo- 
lation methods  (Deuflhard  (1978)).   Dr.  Krogh  is 
well  known  for  his  work  in  the  development  of 
efficient  multistep  methods  for  systems  of  ODES. 
The  multir::ep  methods  he  has  developed  can  handle 


second  order  equations  directly  and  he  has  in- 
vestigated the  advantages  of  doing  so  (Krogh 
(1975)).   This  discussion  is  to  be  an  informal 
discussion  of  the  state-of-the-art  in  this  area. 
Each  panelist  will  discuss  aspects  of  this  area 
that  he  has  been  Involved  in  or  finds  to  be  of 
particular  interest. 

It  is  of  course  impossible  to  anticipate  all 
the  issues  that  will  be  addressed,  but  we  will 
list  a  few  of  the  issues  and  questions  that  are 
currently  under  active  investigation  and  which 
are  likely  to  arise.   The  first  issue  is  the 
obvious  one  of  detailing  the  advantages  direct 
methods  have  over  the  alternative  approach  of 
reducing  the  system  to  a  larger  system  of  first 
order  equations.   Of  course  this  is  very  much 
related  to  the  general  topic  of  comparing  numeri- 
cal methods  (to  be  discussed  in  another  panel 
session).   The  difficulties  involved  in  comparing 
methods  are  particularly  relevant  here  since  we 
are  not  only  interested  in  comparing  different 
direct  methods,  but  also  in  quantifying  the  ad- 
vantage of  direct  methods. 

A  second  topic  which  has  recently  received 
attention  from  several  investigators  (Oear 
(1978),  Dahlquist  (1978)  and  Thomas  and  Gladwell 
(1978)),  is  the  question  of  stability  of  direct 
methods.   Are  there  classes  of  problems  where  a 
large  region  of  absolute  stability  is  important, 
and  if  so,  what  is  a  suitable  definition  of  sta- 
bility for  direct  methods?   This  issue  seems 
particularly  crucial  to  problems  arising  in 
structural  dynamics. 

Another  issue  that  is  of  considerable  interest 
is  the  question  of  what  an  appropriate  error 
control  strategy  should  be  for  second  order 
equations.   Although  this  question  is  reasonably 
well  understood  for  first  order  systems,  the 
situation  for  second  order  systems  is  not  as 
clear.   Is  it  necessary  to  monitor  and  control 
the  local  error  in  y'  as  well  as  y  for  problems 
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vpe  I  (or  II).   Some  theoretical  as  well  as 
computational  Investigations  would  be  helpful 
here. 

As  a  final  example  of  issues  that  will  likely 
be  discussed,  we  will  mention  exponential  fitting. 
This  idea  of  choosing  a  parameter  (or  parameters) 
of  the  formula  to  match  certain  known  character- 
istics of  the  solution  is  a  topic  that  has 
received  much  attention  in  the  literature  for 
both  first  and  second  order  equations.   It  is 
important  to  realize,  that  for  this  approach  to 
be  viable,  one  must  take  care  in  the  selection  and 
implementation  of  compatible  error  estimators  and 
stepsize  strategies.   These  components  of  a  method 
must  recognize  the  'built-in'  accuracy  of  the 
underlying  formula  and  also  account  for  the  fact 
that  the  'error  coefficient'  of  the  formula  will 
now  depend  on  the  stepsize.   It  is  also  important 
to  consider  whether  these  formulas  will  general- 
ize to  systems  of  equations,  and  whether  this  will 
require  an  approximation  to  the  matrix  expo- 
nential and  the  derivation  of  formulas  with  matrix 
coefficients. 
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PANEL  DISCUSSION  ON  NUMERICAL  METHOD  OF  LINES  SOFTWARE 
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1.  Introduction  (G.  D.  Byrne) 

The  numerical  method  of  lines  (NMOL)  involves 
the  following.  A  system  of  partial  differential 
equations  (PDE's)  is  discretized  in  its  space-like 
variables  to  obtain  a  system  of  ordinary  differ- 
ential equations  (ODE's)  in  the  time-like  variable, 
which  is  then  solved  by  an  ODE  package.  In  a 
numerical  method  of  lines  package,  the  spatial 
discretization  may  be  by  a  predetermined  finite 
difference  scheme,  a  user  specified  finite 
difference  scheme  of  a  certain  general  type,  a 
Galerkin  procedure  in  conjunction  with  B-splines, 
or  a  collocation  procedure  in  conjunction  with 
B-splines.  In  the  case  of  Galerkin  and 
collocation  procedures,  essential  boundary 
conditions  lead  to  algebraic  equations  or 
constraints,  which  are  interspersed  with  the  ODE's 
which  must  be  solved.  For  the  very  simple  heat 

equation,  u  =  u  ,  finite  difference-NMOL 
t    xx 

involves  the  solution  of  a  system  of  ODE's  of  the 

form  dy/dt  =  f(y,t),  while  collocation-B-spline 

or  Galerkin-B-spline  NMOL  involves 

A- (dy/dt)  =  g(y,t)  and  the  coefficient  matrix 

A  may  be  singular.  Systems  of  these  types  of 

ODE's  also  arise  from  less  simple  PDE's  for 

these  discretizations.  Of  course,  other 

discretizations  are  possible. 

The  panelists  are  associated  with  several 
general  purpose  NMOL  software  packages  which  use 
each  of  the  discretization  procedures  cited 
above.  Several  of  the  panelists  have  also 
developed  special  purpose  NMOL  software  for 
the  solution  of  specific  problems  in  heat 
conduction,  oil  reservoir  simulation,  and  combus- 
tion. Some  panelists  have  also  been  involved  in 
the  development  of  ODE  software  and  in  work  with 
various  nonlinear  and  linear  systems  solvers  for 
use  within  the  ODE  software  as  in  an  NMOL  package. 

In  the  abstracts  which  follow,  the  partici- 
pants are  concerned  with  the  requirements  imposed 
on  the  ODE  solver  by  the  types  of  ODE's  solved  in 


NMOL  packages,  desirable  features  of  the  ODE 
solver,  the  interface  between  the  ODE  solver  and 
the  rest  of  the  NMOL  code,  the  interface  between 
the  user  and  the  NMOL  code,  and  possible 
directions  of  research  in  packages  for  large, 
stiff  systems  of  ODE's,  which  arise  in  the  NMOL 
solution  of  multidimensional  systems  of  PDE's. 


2.  The  Ordinary  Differential  Equation  Algorithm 
as  Core  Component  of  a  Method  of  Lines  Partial 
Differential  Equation  Software  Package 
(M.  B.  Carver) 

The  properties  required  of  an  ODE  integrator 
within  a  method  of  lines  PDE  package  are  different 
from  those  required  of  an  integrator  offered  in  a 
general  subroutine  library  only  in  one  respect: 
that  the  user  is  one  more  step  removed  from  hands- 
on  control  of  the  integrator. 

Even  in  a  subroutine  library,  the  average 
user  lacks  sufficiently  specialized  knowledge  to 
guide  integration  by  parameter  selection.   In 
either  context,  therefore,  one  requires  sufficient 
robustness  to  automatically  negotiate  predictable 
difficulties  without  the  guidance  of  a  clairvoyant 
user,  but  enough  scope  for  the  informed  user  to 
provide  tuning  adjustments. 

A  number  of  efficient,  well -documented, 
error-controlled,  variable-step-size,  integration 
algorithms  are  available,  for  example  those 
discussed  in  references  [1  to  4]  and  it  is  a 
prerequisite  to  provide  such  a  routine  as  the  core 
of  any  integration  package.  However,  there  are 
several  specific  areas  where  traditionally 
required  user  interaction  can  be  either  eliminated 
or  reduced  to  optional  status  to  improve 
effectiveness.  They  are: 

(a)  Initial  Step  Size  Selection:  A  routine  with 
efficient  step  size  adjustment  is  far  better 
qualified  than  a  user  to  provide  this 
selection. 
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(b)  Tolerable  Error  Imposition:  An  algorithm 
normally  accepts  a  step  if  the  estimated 
local  truncation  error  in  Y.  satisfies 

T0L*Ybase.,  where  Ybase.  is  frequently 

defined  as  the  maximum  attained  value  of  Y. 

or  made  available  as  an  array  for  user  speci- 
fication. Again  users  have  no  criteria  on 
which  to  nominate  each  Ybase.,  and  the 

automatic  assignment  Ybase-  ■  max( | Y. | ,Ysig) 

J  J 

is  general,  and  requires  the  user  only  to 
provide  Ysig,  the  value  below  which  any  jY|  is 
effectively  zero,  if  this  differs  from  the 
machine  unit  round  off. 

(c)  Jacobian  Evaluation  and  Handling:  Most 
general  integrators  require  estimates  of  the 
Jacobian  matrix  from  time  to  time,  and  the 
user  is  often  left  to  speculate  on  a  choice 
of  how  these  should  be  obtained.  Sophisti- 
cates may  gain  efficiency  by  coding  their  own 
Jacobian,  but  within  a  method  of  lines 
package  this  possibility  grows  more  remote, 
and  could  end  up  less  efficient  than  numerical 
evaluation.  Standard  numerical  procedure  is 
to  perturb  each  individual  Y.  to  obtain  the 

set  8F./3Y.,  requiring  n  function  evaluation 

calls.  This  may  be  optimized;  for  example, 
a  tridiagonal  matrix  Jacobian  may  be  eval- 
uated using  only  three  calls,  and  propor- 
tionate savings  may  be  realized  in  sparse 
matrices  [5].  Sparse  matrix  Jacobian 
evaluation  and  handling  optimized  in  this  way 
is  invaluable  for  general  PDE/ODE  packages  in 
which  structure  is  arbitrary.  The  mechanics 
are  further  discussed  in  a  paper  to  this 
conference  [6]. 

(d)  Discontinuity  Detection  and  Negotiation:  A 
switch  in  equation  definition  occuring  at  an 
unknown  time  dependent  on  critical  relation- 
ships evolving  during  integration  is  a  common 
physical  phenomenon,  but  is  traumatic  for  an 
integrator  and  frequently  leads  to  time  limit 
or  termination.  A  robust  integrator  should 
detect  such  a  discontinuity  and  transcend  it 
issuing  a  suitable  terse  report.  Optional 
user  interaction  to  facilitate  smoother 
handling  of  discontinuities  should  be  made 
available,  for  example  see  reference  [7]. 

(e)  Calling  Sequence  and  Common  Blocks:   In  a 
package,  the  user  is  shielded  from  direct 
communication  through  the  integrator 
parameter  list,  but  this  is  mentioned  here 
to  facilitate  library  or  package  interface 
design.  Any  integrator  calling  sequence 
requires  ten  parameters,  the  derivative 
subroutine,  dependent  variable  and  derivative 
vectors  and  their  size,  independent  variable 
and  interval,  tolerance,  warning  flag,  one 
working  storage  array  and  current  step  size. 
Only  these  parameters  are  mandatory  and  all 
other  communication  may  be  through  optional 
common  blocks.  The  integrator  itself  can 
assign  parts  of  the  working  storage  array  in 
predictable  sequence,  and  choose  a  suitable 
method  for  handling  the  Jacobian  from  an 
assessment  of  the  equation  structure  and 


available  working  storage. 

The  GEARZ  integration  algorithm  used  in  the 
F0RSIM[8]  and  MAKSIM[9]  simulation  packages 
combines  the  above  consideration  with  the  work 
described  in  references  [2,  5  and  10]. 
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3.  Method-of -Lines  Considerations  in  ODE  Software 
Design  (A.  C.  Hindmarsh) 

Among  the  most  frequent  uses  of  ordinary 
differential  equation  (ODE)  software  are  those  for 
the  solution  of  partial  differential  equations 
(PDE's)  by  the  numerical  method  of  lines  (NMOL). 
Typical  of  such  problems  are  system^  of  parabolic 
PDE's  in  time  t  and  a  space  vector  x,  given 
initial  values  at  t  =  0  and  boundary  conditions  in 
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x.  The  resulting  ODE  problems  are  among  the  most 
challenging  because  of  their  potential  size, 
complexity,  and  stiffness.  PDE-based  problems  are 
frequently  (but  not  always)  stiff,  either  because 
of  physical  processes  present  (e.g.  chemical 
kinetics),  or  because  of  the  particualr  features 
of  the  spatial  discretization  (e.g.  a  simple  1-D 
diffusion  equation  finite-differenced  on  a  fine 
mesh).  The  combination  of  size  and  stiffness 
imposes  severe  restrictions  on  the  choice  of  ODE 
methods  and  on  their  implementation  —  restric- 
tions that  differ  greatly  from  those  for  nonstiff 
problems. 

Without  going  into  the  details  of  NMOL 
discretizations,  the  final  result  can  be  given  in 
one  of  two  forms.  For  finite  difference 
approaches,  one  generally  obtains  an  explicit  ODE 
system  y  =  f(y,t),  where  y  is  a  vector  of  length 
N,  and  an  initial  value  vector  y  is  known.  For 

weighted  residual  methods  (collocation,  Galerkin, 
fintie  element,  etc.),_  one  gets  instead  a  linearly 
implicit  ODE  system  Ay  =  g(y,t)  where  A  =  A(y,t) 
is  an  N  x  N  matrix  (usually  nonsingular)  wnose 
form  depends  on  both  the  discretization  method  and 
the  geometry  of  the  problem.   In  the  latter  case, 
cost  considerations  require  that  the  ODE  algorithm 
treat  the  system  in  a  direct  manner,  rather  than 
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invoke  an  algorithm  for  y  =  f  with  f 

When  the  ODE  is  stiff,  accuracy  and  numerical 
stability  dictate  the  use  of  implicit  methods. 
Nearly  all  implicit  methods,  when  implemented  so 
as  to  retain  the  desired  numerical  stability, 
solve  linear  algebraic  system  that  involve  the 
N  x  N  matrix  J  =  3f/3y,  or  both  A  and  J  =  3g/3y 
in  the  case  Ay  =  g.  While  the  literature  now 
contains  numerous  stiff  ODE  methods,  most  NMOL 
applications  use  methods  based  on  the  Backward 
Differentiation  Formulas  (BDF's).  The  reasons 
are  partly  historical,  and  partly  based  on  the 
following  facts  about  BDF  methods: 


a  modified  Newton  (chord)  iteration  on  the 
corrector,  but  handle  only  y  =  f  and  assume  J  to 
be  dense.  They  save  LU  factorizations  of  the 
required  matrix,  and  recompute  this  only  as 
needed.   In  NMOL,  J  and  A  invariably  have  much 
sparse  structure,  and  property  4  becomes  crucial. 
To  capitalize  on  it,  variant  versions  of  GEAR  and 
EPISODE  have  been  written,  dictated  by  structures 
that  commonly  occur  in  PDE-based  ODE  systems.  The 
modular  design  of  the  solvers  easily  allows  the 
dense  linear  system  treatment  to  be  replaced 
accordingly.  The  implicit  ODE  case  requires  some 
alterations  to  other  parts  of  the  algorithm  as 
well.  The  following  variants  exist  currently: 


1.  GEARB  treats  J  in  band  form.   It  is  used 
in  the  NMOL  packages  PDEPACK,  M0L1D, 
PDETWO,  DSS/2,  and  FORSIM. 

2.  EPISODEB  is  the  band  variant  of  EPISODE, 
analogous  to  GEARB. 

3.  GEARBI  treats  J  in  blocked  form  and  uses 
a  block-iterative  linear  system  algorithm 
(block-SOR).  It  is  used  in  various  1-D 
and  2-D  applications. 

4.  GEARS  assumes  a  general  sparse  form  for  J 
and  uses  the  Yale  Sparse  Matrix  Package. 

5.  GEARIB  treats  the  implicit  ODE  Ay  =  g 
with  A  and  J  =  3g/3y  in  band  form.  It 
is  used  in  the  NMOL  packages  PDECOL  and 
DISPL. 

6.  EPISODEIB  is  the  variant  of  EPISODE 
analogous  to  GEARIB. 

In  the  explicit  ODE  case,  these  solvers  retain  a 
nonstiff  (Adams)  option  for  convenience,  and  some 
of  the  NMOL  packages  relay  this  option  back  to 
the  user. 


1.  They  form  a  group  of  stiffly  stable 
methods  of  various  orders,  easily 
allowing  for  variability  of  step  size 
and  order. 

2.  They  require  the  solution  of  only  one 
N  x  N  nonlinear  algebraic  system  on 
each  time  step,  and  so  require  only 
one  copy  of  J  or  a  related  matrix 
(involving  A,  if  present  )  at  a  time. 

3.  They  do  not  need  J  (or  A,  if  present)  to 
be  very  accurate,  and  so  do  not  require 
a  reevaluation  of  J  at  every  step. 

4.  They  easily  allow  for  sparse  structure 
of  J  (and  A,  if  present)  to  be  utilized 
to  gain  both  speed  and  storage  economy. 

Few  other  methods  have  all  of  these  properties. 

There  are  two  ODE  solvers  which  have  seen 
particularly  heavy  use  in  the  context  of  NMOL: 
The  GEAR  package  uses  (for  stiff  problems)  fixed- 
step  BDF'r>  with  interpolator  step  changing.  The 
newer  EPISODE  package  resembles  GEAR  externally 
but  is  based  on  truly  variable-step  BDF's.  Both 

advantage  of  properties  1  to  3  above,  using 


Some  applications  have  involved  more  special 
variants.  For  example,  GEAR  has  been  combined 
with  a  tailored  preconditioned  conjugate  gradient 
algorithm  for  use  on  radiation  diffusion- 
scattering  problems.  - 

Alternatives  to  chord  iteration  exist  but 
have  not  been  tried  extensively  in  stiff  ODE 
software.  Experiments  with  nonlinear  block-SOR 
iteration  in  an  NMOL  context  have  shown 
considerable  promise.  Another  promising  approach, 
in  2-D  and  3-D,  is  to  combine  a  BDF  (or  similar) 
method  with  a  multiplicative  matrix  splitting 
of  the  type  used  in  ADI. 


4.  Some  Comments  on  the  Impact  of  PDE  Software 
on  ODE  Solvers  (G.  K.  Leaf  and  M.  Minkoff) 

A  common  feature  of  software  for  solving 
parabolic  differential  equations  is  the  use  of  the 
method  of  lines  for  the  reduction  of  the  partial 
differential  equation  to  a  system  of  ordinary 
differential  equations.  A  typical  PDE  package 
will  then  use  an  ODE  package  to  solve  the 
resulting  system  of  differential  equations.  The 
use  of  ODE  solvers  in  conjunction  with  PDE 
software  imposes  certain  requirements  on  the  ODE 


W-3 


solver.  First  we  note  that  the  method  of  lines 
usually  leads  to  systems  of  stiff  ordinary 
differential  equations;  hence  implicit  time  differ- 
encing is  generally  required.  Such  implicit  method 
use  Newton-type  nonlinear  equation  solvers  which 
require  estimates  for  the  Jacobian.  Particularly 
in  two  spatial  dimensions,  the  storage  of  this 
Jacobian  becomes  the  dominant  consideration  in  the 
implementation  of  PDE  software.  With  the  use  of 
local  approximations  for  spatial  effects,  the 
storage  problem  is  ameliorated  somewhat  by  the 
banded  structure  of  the  Jacobian.  When  a  Galerkin 
type  procedure  is  used,  the  resulting  system  of 
ODE's  will  be  implicit  in  time.  This  makes 
it  desirable  to  have  solvers  which  can  treat 
implicit  time  dependence  in  an  efficient  manner. 
When  the  method  of  lines  is  used,  the  boundary 
conditions  associated  with  the  PDE's  lead  to 
system  of  algebraic-differential  systems.  Finally, 
with  PDE's  a  substantial  cost  savings  can  be 
achieved  with  the  use  of  dump/restart  features. 
Such  features  are  almost  mandatory  in  ODE  solvers 
used  in  conjunction  with  PDE  software. 


5.  Two  Proposed  Standard  Interfaces  to  Facilitate 
the  Use  of  NMOL  Codes  (W.  E.  Schiesser) 

The  remarkable  flexibility  and  versatility  of 
the  numerical  method  of  lines  (NMOL)  will  most 
certainly  facilitate  the  integration  of  systems  of 
ordinary  and  partial  differential  equations  over 
a  spectrum  of  application  areas.  This  is 
particularly  true  for  scientists  and  engineers  who 
do  not  wish  to  become  experts  in  numerical 
analysis.   If  some  general  guidelines  are  developed 
for  interfacing  various  components  of  a  NMOL  code, 
the  user  will  be  able  to  take  better  advantage  of 
the  flexibility  of  the  method,  and  the  exchange 
of  applications  between  users  will  be  facilitated. 
In  this  presentation,  two  interfaces  will  be 
considered: 

(1)  The  interface  between  the  ODE  integrator 
and  the  routines  which  approximate  the 
spatial  (boundary-value)  derivatives  in 
partial  differential  equations  will  be 
considered.   In  particular,  the  freedom 
to  independently  select  the  methods  of 
integration  for  initial-value  and 
boundary-value  derivatives  should  be 
emphasized  to  users. 

(2)  The  interface  between  the  general -purpose 
NMOL  code  and  the  routines  which  define 
the  problem  should  be  standardized  as 
much  as  possible  to  facilitate  the 
exchange  of  application  problems.  The 
essential  elements  (e.g.,  routines)  to 
define  a  problem  will  be  considered, 

and  some  suggestions  given  for  a 
relatively  standardized  interface  between 
the  problem  statement  provided  by  the 
user  and  the  NMOL  code. 


6.  Comments  on  the  Numerical  Method  of  Lines 
(R.  F.  Sincovec) 


differential  equations  is  a  remarkable  success. 
Moreover,  this  approach  to  solving  partial 
differential  equations  has  allowed  the  development 
of  some  very  useful  and  modestly  reliable  general 
purpose  software  for  PDEs.   In  large  measure  these 
successes  are  due  to  the  availability  of  good 
software  for  solving  ordinary  differential 
equations. 

Unfortunately,  one  dimensional  models  are  not 
adequate  for  simulating  many  physical  phenomena 
and  quite  often  one  must  use  a  model  which  is 
described  or  characterized  by  multidimensional 
partial  differential  equations.  The  fundamental 
concept  of  the  numerical  method  of  lines  is 
essentially  dimension  independent,  so  in  principle 
the  technique  may  be  directly  applied  to  the 
higher  dimensional  problems.  However,  some  of  the 
practical  computational  aspects  of  the  approach 
have  prevented  its  widespread  use  of  the  higher 
dimensional  problems  which  require  the  use  of  stiff 
ODE  methods.  For  these  problems,  the  limitations 
occur  in  the  form  of  storage  requirements  and 
Jacobian  matrix  generation  and  solution  time.  The 
most  frequently  used  stiff  ODE  techniques  are 
those  of  Gear  and  the  Jacobian  matrix  is  required 
because  a  modified  Newton  method  is  used  to  solve 
the  implicit  nonlinear  corrector  equations.  When 
a  direct  method  is  used  to  solve  the  resulting 
linear  algebraic  systems,  the  computation  time 
per  equation  per  time  step  increases  drastically  as 
the  number  of  spatial  dimensions  increases.  It 
would  be  extremely  useful  if  reliable  and  efficient 
stiff  ODE  methods  could  be  developed  which  do  not 
require  so  much  storage  and  computational  cost  per 
time  step.  In  particular,  it  would  be  advantageous 
to  use  a  nonlinear  method,  which  requires  only  some 
portion  of  the  associated  Jacobian  matrix. 
Alternatively,  explicit  methods  with  larger 
stability  regions  than  those  now  in  common  use 
could  be  explored.   It  is  conceivable  that  such 
ODE  methods,  which  are  basically  different  from 
the  current  backward  differentiation  formulas, 
could  be  developed  -  at  least  for  some  classes 
of  problems. 

Another  approach  would  be  to  investigate  and 
develop  other  solution  techniques  for  the 
nonlinear  corrector  equations.  The  goal  would  be  to 
find   adequate  nonlinear  equation  solution  methods 
whose  cost  in  both  storage  and  execution  time  is 
less  affected  by  dimensionality  considerations. 
Nonlinear  SOR-Newton,  Newton-SOR  or  ADI  techniques 
could  be  potential  candidates.  The  use  of  any  of 
these  methods  would  probably  impose  restrictions 
on  the  type  of  problems  which  can  be  solved  and 
this  needs  to  be  investigated.  Also,  the 
relationships  between  the  convergence  criteria 
and  the  error  and  stepsize  control  in  the  ODE 
solver  would  have  to  be  explored. 

Many  problems  encountered  in  practice  can  be 
reduced  by  the  method  of  lines  to  a  coupled 
differential -algebraic  system.  More  attention 
needes  to  be  paid  to  such  systems.  Finally,  the 
ODE  software  itself  should  feature  dynamic 
dimensioning. 


The  use  of  the  numerical  method  of  lines  for 
the  solution  of  one  dimensional  partial 


W-M 


PANEL  ON  TESTING 


Moderator:   T.E.  Hull,  Department  of  Computer  Science,  University  of  Toronto,  Toronto,'  Canada 

Panelists:   M.B.  Carver,  Atomic  Energy  of  Canada  Limited,  Chalk  River,  Ontario,  Canada 

W.H.  Enright,  Department  of  Computer  Science,  University  of  Toronto,  Toronto,  Canada 
I.  Gladwell,  Department  of  Mathematics,  University  of  Manchester,  Manchester,  England 
F.T.  Krogh,  Jet  Propulsion  Laboratory,  Pasadena,  California,  U.S.A. 
L.F.  Shampine,  Sandia  Laboratories,  Albuquerque,  New  Mexico,  U.S.A. 


Each  panelist  was  asked  to  write  a  brief  summary  of 
his  point  of  view  and/or  any  comments  he  would  like 
to  make  on  this  topic,  and  the  following  is  what 
was  received: 

M.B.  Carver 

There  is  no  doubt  that  a  program  of  testing 
will  produce  the  most  useful  information  if  it 
investigates  the  widest  possible  spectrum  of  equa- 
tion set  type  and  size,  accuracy  level  requested, 
range  of  magnitude  of  variables,  etc.,  but,  given  a 
comprehensive  test  program  and  amassed  results,  the 
criteria  used  to  assess  or  rate  algorithms  will 
depend  heavily  on  the  type  of  use  for  which  an 
algorithm  is  to  be  selected.   While  a  considerable 
market  exists  for  extremely  precise  results,  proba- 
bly the  greater  use  of  integration  algorithms  is 
in  general  engineering  and  scientific  calculations 
in  which  three  to  five  reliable  digits  suffice. 

Such  algorithms  usually  reside  in  general  sub- 
routine libraries  or  are  embedded  in  user-oriented 
simulation  packages,  and  the  average  user  will  not 
be  equipped  to  select  parameters  to  guide  the 
integration  algorithm.   Thus  the  primary  judgement 
criterion  for  such  use  is  reliable  performance  in 
this  accuracy  range,  and  robust  or  self  sufficient 
behaviour  on  encountering  problems,  such  as  dis- 
continuities, which  are  common  in  practice  but 
traumatic  in  integration. 

W.H.  Enright 

During  the  past  several  years  a  group  of  us  at 
Toronto  have  been  involved  in  the  development, 
assessment  and  comparison  of  numerical  methods  for 
ODE's.   These  investigations  have  convinced  us  of 
the  need  for  a  collection  of  testing  programs  to 
aid  in  the  evaluation  of  numerical  methods,  and  the 
design  and  use  of  such  a  package  is  the  subject  of 
this  paper.   The  collection  of  programs  actually 
consists  of  two  very  similar  packages;  one  for  the 
assessment  of  non-stiff  methods  and  another  for  the 
assessment  of  stiff  methods.   Before  we  discuss  the 
test  programs  in  more  detail  we  must  describe  the 
type  of  applications  we  envisage  for  it  and  dis- 
tinguish two  points  of  view  that  will  be  taken  when 
interpreting  the  results. 

With  the  number  of  new  methods  being  developed 
•  publication,  it  Ls  .'i  formidable 
task  for  an  at        a  referee  to  evaluate  a  now 
method.   In  order  to  publish  an  algorithm  in  a 


technical  journal  the  author  is  expected  to  provide 
evidence  of  its  performance  characteristics  compared 
to  other  methods.   This  evidence  all  too  often  takes 
the  form  of  results  of  the  new  method  relative  to 
out-of-date  methods  on  a  few  carefully  selected 
problems.   Refereeing  such  submissions  is  difficult 
and  a  referee  is  often  reluctant  to  ask  for  more 
evidence  since  this  often  would  require  that  an 
author  obtain  and  implement  a  more  appropriate 
reference  method  and  run  several  new  test  problems. 
On  the  other  hand,  if  a  catalogue  of  standard  bench- 
mark results  were  available  for  existing  software, 
and  if  these  results  could  be  duplicated  and  results 
for  new  methods  readily  obtained,  then  we  could 
require  much  stronger  evidence  of  performance  for 
new  methods.   Another  use  of  the  testing  package 
would  be  that  of  a  potential  user  with  a  particular 
type  of  problem.   He  may  wish  to  determine  which 
method  is  most  suitable  for  him  by  benchmarking  the 
performance  of  available  methods  on  a  set  of  his 
own  model  problems. 

There  are  two  points  of  view  that  may  be  taken 
when  assessing  the  performance  of  a  numerical 
method.   One  may  be  primarily  interested  in  the 
evaluation  or  assessment  of  the  performance  of  an 
individual  method,  or  alternatively  he  may  be  in- 
terested in  comparing  the  relative  performances  of 
different  methods.   In  the  former  case  it  is  essen- 
tial that  the  performance  criteria  and  performance 
monitors  that  are  introduced  have  no  effect  on  the 
method  being  assessed.   That  is,  we  must  ensure  that 
the  method  is  not  modified  in  any  way,  as  such  modi- 
fications might  degrade  the  performance  of  the 
method.   In  order  to  satisfy  this  condition  the 
testing  package  must  be  flexible  and  able  to  assess 
a  particular  method  on  how  well  it  (the  method)  is 
accomplishing  what  it  was  designed  to  accomplish. 

On  the  other  hand,  when  one  is  comparing 
methods,  the  various  measures  of  reliability  and 
efficiency  can  only  be  compared  if  the  methods  being 
compared  are  accomplishing  the  same  task.   One  way 
to  ensure  that  this  is  the  case  is  to  modify  the 
methods  so  that  they  are  all  attempting  to  accom- 
plish the  same  task.   This  was  the  approach  adopted 
in  Hull  et  al  (1972),  Enright  et  al  (1975)  and 
Enright  et  al  (1976).   This  approach  allows  one  to 
compare  the  relative  performances  of  different 
techniques  or  approaches  but  it  must  be  acknowledged 
that  modified  methods  are  compared,  and  these  modi- 
fications might  have  a  greater  effect  on  some 
methods  than  others. 
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Lng  Initial  value  solvers 
ran  a  varietv  These  include 

r  visit  co  the  University  of  Toronto, 
stions  by  inexpi ■;       users  which 
I  fv        Le  to  inswer  without  extensive  testing, 
□  my  involvement  as  contributor  oi  the  ordi- 
L  equations  chapter  of  the  NAG 
iv.   I  have  tested  initial  value  solvers  in 
two  v        <ned  to  be  complementary  to  the  many 
ts  published  in  the  literature.   First  with 
es  I  have  performed  tests  where  the  ini- 
tial value  solvers  are  treated  as  a  black  box  and 
whert        :'t  to  simulate  their  use  by  an  inex- 

nced  user  for  low  accuracy  computation.   In 
these  tests  a  variety  of  stiff  and  non-stiff  sol- 
vers were  tested  and  a  selection  of  the  test  results 
are  available  in  an  internal  report.  Few  conclu- 
sions are  drawn  in  this  report  as  too  many  anoma- 
lies arise  in  this  type  of  testing.   In  a  second 
set  of  tests,  a  research  student  and  I  have  per- 
formed some  simple  profile  tests  (similar  to  those 
of  Lyness  and  Kaganove  for  quadrature)  on  a  small 
number  of  well-known  subroutines.   These  tests  do 
illustrate  the  effect  of  coding  features  of  the 
routines,  and  it  is  hoped,  in  further  tests,  to 
attempt  to  modify  the  coding  to  improve  the  profile 
test  results.   The  completed  tests  are  summarized 
in  a  report  which  is  in  preparation. 

F.T.  Krogh 

There  are  too  many  factors  to  be  considered 
in  the  testing  of  ODE  programs  for  me  to  attempt  to 
summarize  my  views  in  a  brief  paragraph.   The 
references:  "On  Testing  a  Subroutine  for  the  Numeri- 
cal Integration  of  Ordinary  Differential  Equations", 
J.  ACM  20  (Oct.  1973),  pp. 545-562,  and  "Opinions 
on  Matters  Connected  with  the  Evaluation  of  Pro- 
grams and  Methods  for  Integrating  Ordinary  Differ- 
ential Equations",  SIGNUM  Newsletter  7  (Oct. 
1972),  pp. 27-48,  do  a  good  job  of  representing 
my  views.   The  following  quote  from  the  second 
of  these  is  concerned  with  the  issue  I  consider 
most  important.   "It  is  hoped  that  those  who  are 
carrying  out  evaluations  for  a  wide  audience 
will  keep  in  mind  the  diversity  of  opinion  likely 
to  be  found  in  that  audience,  and  thus  will  pre- 
sent results  in  such  a  way  that  even  those  with  a 
quite  different  point  of  view  than  the  evaluator 
are  able  to  draw  conclusions  from  the  results." 

L.F.  Shampine 

I  am  most  concerned  with  two  aspects  of 
testing.   One  is  the  question  of  comparability. 
Often  codes  do  not  have  the  same  goals  and  even 
if  they  do,  the  way  the  task  must  be  specified  may 
be  incompatible.  Carelessness  about  the  nature 
and  purpose  of  tests  lead  to  incorrect  conclu- 
sions about  the  codes  tested.   The  other  aspect  is, 
what  are  the  tests  supposed  to  prove?  If  a  code 
performs  well  on  a  battery  of  problems,  so  what? 
We  need  theoretical  insight  to  interpret  and  to 
design  such  tests.   Analysis  of  models  would  also 
be  useful  for  drawing  better  substantiated  con- 
clusions . 
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A  USER  INTERFACE  STANDARD  FOR  ODE  SOLVERS 

Alan  C.  Hindmarsh 
Lawrence  Livermore  Laboratory 
Livermore,  CA  94550 


INTRODUCTION 

During  the  last  10  to  15  years,  a  number  of 
good  general  purpose  Fortran  programs  have  been 
developed  for  the  solution  of  the  initial  value 
problem  for  systems  of  ordinary  differential  equa- 
tions (ODE's).   However,  the  various  solvers  differ 
substantially  from  one  another,  and  the  various 
versions  of  each  individual  solver  often  differ 
substantially  from  site  to  site.   This  lack  of 
standardization  causes  inconvenience  and  confusion 
for  users  and  maintenance  headaches  for  the 
authors  of  the  solvers. 

At  the  same  time,  efforts  undertaken  in  other 
areas  of  numerical  analysis  have  succeeded  in 
producing  "systematized"  collections  (EISPACK, 
FUNPACK,  LINPACK,  MINPACK,  etc.)  of  software  of 
high  quality  and  meeting  certain  uniformity  stand- 
ards.  The  idea  of  producing  a  similar  systematized 
collection  of  ODE  solvers  (initial  value  problem), 
tentatively  called  ODEPACK,  has  been  discussed 
since  1973  by  various  groups  of  people,  from  all 
over  the  world.   Since  1976,  efforts  related  to 
the  ODEPACK  concept  have  primarily  involved  only 
people  at  Dept.  of  Energy  laboratories,  and  have 
been  funded  in  part  by  the  Applied  Mathematical 
Sciences  Research  Program  under  the  Office  of 
Basic  Energy  Sciences  in  DOE. 

A  necessary  preliminary  to  any  actual 
development  of  an  ODEPACK  is  the  setting  of 
standards  for  the  interface  between  the  user  and 
the  ODE  solvers.   Thus  there  has  been  considerable 
discussion  of  a  detailed  User  Interface  Standard 
(UIS)  for  ODEPACK.   The  user  interface  for  a  given 
solver  consists  largely  of  the  call  sequence  of 
the  subroutine  which  the  user  must  call,  together 
with  the  call  sequences  of  the  one  or  more  user- 
supplied  routines  which  the  solver  calls.   The 
appropriate  content  of  the  user  interface  for  an 
ODE  solver  varies  greatly  with  the  nature  of  the 
user  and  the  nature  of  the  solver.   The  goal  of  a 
UIS  is  to  Identify  the  features  that  are  appropri- 
ate to  all  (or  almost  all)  such  contexts  and  put 
those  features  in  a  concrete  form  that  is  as 
simple  as  possible  and  yet  meets  the  variety  of 
demands  envisioned.   The  UIS  summarized  below  has 
evolved  from  numerous  discussions  and  meetings, 
involving  about  20  people. 

The  need  to  allow  for  a  wide  variety  of 
solvers  and  user  demands  results  in  a  UIS  which 
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are  to  be  standard  across  the  ODEPACK  collection 
and  required  of  all  users.   It  includes  also  items 
of  communication  that  are  solver-dependent  and/or 
optional.   Solver-dependent  items  are  ones  that 
will  appear  in  the  interface  of  some  solvers,  but 
not  others.   Optional  items  in  a  given  solver 
interface  are  ones  that  the  user  may  supply  (if 
input)  or  access  (if  output)  but  is  not  required 
to.   In  the  case  of  optional  inputs,  default 
values  are  always  available.   Solver-dependent 
items  may  or  may  not  be  optional,  and  optional 
items  may  or  may  not  be  solver-dependent.   In  all 
cases,  these  items  are  present  in  the  UIS  to  as- 
sure that  their  names  and  meanings  will  be 
standard  where  and  when  they  occur.   The  UIS  is 
complicated  further  by  the  fact  that  communication 
need  not  be  strictly  through  the  call  sequence: 
Optional  inputs  and  outputs  may  be  communicated  by 
calls  to  other  routines,  by  Common,  or  in  some 
cases  by  allowing  the  user  to  supply  his  own 
version  of  one  of  the  subroutines  in  the  solver. 

What  follows  is  a  summary  of  the  present  UIS. 
Complete  details  can  be  found  in  Ref.  1. 


SUMMARY  OF  THE  USER  INTERFACE  STANDARD 

The  primary  means  of  communicating  with  the 
solver  will  be  the  call  sequence  of  the  subroutine 
associated  with  the  solver.   The  parameters  of 
this  call  sequence  are  summarized  below. 

F        =  name  of  user-supplied  subroutine 

defining  the  ODE  system,  y  =  f(t,y). 


NEQ 
Y 


TOUT 


size  of  first-order  ODE  system. 

array  of  dependent  variables.   Used  to 
input  initial  values  and  output 
answers. 

independent  variable.   Used  to  input 
initial  point  and  output  the  point  at 
which  Y  is  evaluated. 

next  value  of  t  at  which  answers  are 
desired. 


ITOL,  RTOL,  ATOL  =  input  parameters  for  error 
control.   Roughly,  estimated  local 
errors  in  y^  will  be  kept  less  than 
RTOL(i) |yi|+AT0L(i)  in  some  norm. 
RTOL  and  ATOL  can  be  of  either  scalar 
or  vector  type,  and  ITOL  has  one  of  4 
values  to  identify  the  type  of  each. 
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•■  input  indicator  for  task  desired: 

1:   normal  calculation  of  y(t)  at 

t  =  TOUT  (possibly  overshoot  and 
interpolate) . 

2i   take  only  one  step  and  return. 

3:   hit  TOUT  exactly. 

A:   stop  at  the  first  internal  mesh 
point  t  at  or  beyond  TOUT. 

5:   take  only  one  step,  without  pas- 
sing TOUT. 

ISTATE    ■  state  indicator  (input  and  output). 

Input : 

1:   first  call  (solver  will 
initialize  itself). 

2:   continue  normally,  with  no 
change  in  inputs  except  TOUT 
and  possibly  ITASK. 

3:   continue  normally,  but  with 
changes  in  other  inputs. 


Output: 
1: 


2 
-1 
-2 
-3 


nothing  done  (TOUT  =  T  on 
first  call). 

task  was  accomplished. 

excess  work  on  this  call. 

too  much  accuracy  requested. 

illegal  input. 

(Other  negative  values  are 
solver-dependent . ) 


JAC      ■  name  of  user-supplied  subroutine  to 
compute  the  Jacobian,  J  =  3f/3y. 

G        =  name  of  user-supplied  function  sub- 
program for  the  function  g  in  the 
rootfinding  problem  g(t,  y(t))  =  0. 

Solver-dependent  optional  inputs  and  outputs 
may  be  communicated  as  parts  of  work  arrays,  by 
Common,  or  by  optional  calls  to  special  routines. 
Optional  inputs  for  which  standard  names  and 
meanings  exist  include  the  maximum  number  of  steps 
per  call  to  the  solver,  the  maximum  number  of  f 
evaluations  per  call,  the  maximum  order,  the 
initial  step  size,  and  upper  and  lower  bounds  on 
step  size.   Optional  outputs  for  which  standard 
names  and  meanings  exist  are  the  current  internal 
mesh  point  in  t;  cumulative  counts  of  steps  and 
evaluations  of  f,  J,  and  g;  the  step  size  and 
order  last  used;  the  index  of  the  largest  weighted 
local  error;  and  a  tolerance  scale  factor  to  be 
used  to  continue  after  a  request  for  too  much 
accuracy  was  flagged. 

There  are  two  optional  calls  by  the  user  to 
other  routines  that  are  defined  in  the  standard. 
One  is  to  reset  the  logical  unit  number  for  the 
printing  of  messages  (if  different  from  the 
default).  The  other  is  to  suppress  all  printing 
of  messages  (the  default  being  to  print  messages 
where  appropriate). 


Reference 

[1]   A.  C.  Hindmarsh,  A  Tentative  User  Interface 
Standard  for  ODEPACK,  LLL  Report  UCID-17954, 
October  1978. 


A  printed  message  will  contain  details 
of  all  returns  with  ISTATE  <  0. 

IOPT      =   input  flag  to  signal  use  of  optional 
inputs. 

0:   no  optional  inputs  (defaults  to 
be  used). 

1:   all  optional  inputs  will  be 
examined. 

RWORK,  IWORK  =  real  (single  or  double  precision) 
and  integer  work  space  arrays.  Length 
formulas  will  be  given.   The  first  few 
words  of  each  may  be  used  for  optional 
commun  ica  t  ion . 

LRW,  LIW  =  user-declared  lengths  of  RWORK,  IWORK. 
(These  will  be  checked.) 

In  addition  to  the  above  parameters,  which 
will  be  required  of  all  solvers,  there  are  solver- 
dependent  parameters  related  to  Jacobian  matrices, 
rootfinding,  method  switches,  etc.   The  following 
standard  names  will  be  used,  where  relevant: 
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AN  INCOMPLETE  LIST  OF  BETTER  FORTRAN  CODES  FOR  IVPS  IN  ODES 


Robert  D.  Skeel 

Department  of  Computer  Science 

University  of  Illinois  at  Urbana-Champaign 

Urbana,  Illinois  61801 


This  listing  is  restricted  to  those  computer 
programs  which  are  available  on  magnetic  tape  or 
punched  cards.   First  is  a  listing  of  codes  grouped 
into  collections  and  then  a  listing  of  software 
distributors.   The  author,  who  alone  is  responsible 
for  the  accuracy  of  this  material,  wishes  to 
acknowledge  the  direct  receipt  of  much  useful 
information  from  the  following  people:   M.  Butler, 
G.  D.  Byrne,  M.  B.  Carver,  P.  A.  Fox,  I.  Gladwell, 
A.  C.  Hindmarsh,  D.  Kainer,  C.  B.  Moler,  D.  Sayers , 
and  L.  F.  Shampine. 


PART  I 


COLLECTIONS  OF  CODES 


1.  International  Mathematical  and  Statistical 
Libraries,  Inc. — Chapter  D:   Differential 
Equations;  Quadrature;  Differentiation, 
distributor:   IMSL,  Inc. 

a.  DGEAR 

documentation:   A.  C.  Hindmarsh,  "GEAR: 
Ordinary  differential  equation  solver," 
UCID-30001,  Rev.  3,  Lawrence  Livermore 
Lab,  U.  of  Calif ornia/Livermore,  December 
1974. 
description:   modification  of  the  Hindmarsh 
package  GEAR  based  on  Adams-Moulton  and 
backward  differentiation  formulas  of 
various  orders. 


Z.  b.  D02BAF/D02BBF/D02BDF/D02BGF/D02BHF/D02PAF, 

D02XAF/D02PAF,  D02XBF 

description:   based  on  Merson's  5-stage 
Runge-Kutta  formula  of  order  4;  various 
drivers  perform  output  at  user  selected 
points,  global  error  estimation,  stiffness 
checks,  root-finding. 

c.  D02CAF/D02CBF/D02CGF/D02CHF/D02QAF,D02XGF/ 
D02QAF,D02XHF 

description:   essentially  GEAR,  Rev.  3  of 
Hindmarsh  with  Adams-Moulton  formulas 
selected;  drivers  for  various  tasks. 

d .  D02EAF/D02EBF/D02EGF/D02EHF/D02QBF , D02XGF/ 
D02QBF,D02XHF 

description:   essentially  GEAR,  Rev.  3  of 
Hindmarsh  with  backward  differentiation 
formulas  selected;  drivers  for  various 
tasks. 

e.  D02YAF 

description:   step-oriented  integrator  based 
on  Merson's  5-stage  Runge-Kutta  formula 
of  order  4. 

ttf.  D02AAF  (superseded  by  D02YAF) 

ttg.  D02ABF  (superseded  by  D02BAF  and  D02PAF) 

tth.  D02AHF  (superseded  by  D02CAF  and  D02QAF) 

tti.  D02AFJ  (superseded  by  D02EAF  and  D02QBF) 


b.  DREBS 

description:   modification  of  the  Bulirsch- 
Stoer  Algol  procedure  DESUB  based  on 
rational  extrapolation  of  Gragg's  method. 

c.  DVERK 

documentation:   T.  E.  Hull,  W.  H.  Enright, 
and  K.  R.  Jackson,  "User's  guide  for 
DVERK — A  subroutine  for  solving  non-stiff 
ODE's,"  T.R.  No.  100,  D.  of  C.S.,  U.  of 
Toronto,  Oct.  1976. 

description:   based  on  8-stage  Runge-Kutta 
formula  of  orders  5  and  6  developed  by 
I.  H.  Verner. 

2.  NAG  FORTRAN  Library,  Mark  7— Chapter  D02 : 
Ordinary  Differential  Equations. 
distributor:   Numerical  Algorithms  Group. 


3.  Bell  Laboratories  Mathematical  Subroutine 

Library,  PORT — differential  equations  chapter 
distributor:   Bell  Laboratories. 

a.  ODES/DODES 

documentation:   N.  L.  Schryer,  "A  user's 
guide  to  DODES,  a  double  precision 
ordinary  differential  equation  solver," 
Computing  Sci.  T.R.  It 33,    Bell  Laborator- 
ies, Murray  Hill,  N.J.,  1975. 
description:   based  on  extrapolation  of 
Gragg's  method. 

note:   uses  the  function  subprogram  I1MACH, 
the  storage  allocator  and  the  error 
handling  facility  of  the  PORT  library. 


D02AFF 

description:   least-squares  fitting  of  a 

I  '.till,,         /nomials  to  solve 
a  system  '.I  linearized  OD 


t  This  routine  will  be  withdrawn  at  Mark  8. 
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I  Algorithms   from  ACM — category   D2 : 
in.irv  Differentia]    Equations 
distributoi  Algorithms  Distribution 

.     Inc. 

ERK 
other  distributor:   NESC  (part  o(   MATHLIB, 
^54) 
•ient.it  ion :   L.  F.  Shampine  and  H.  A. 
Watt  .      irlthm  504  CERK:   Global  Error 
Est  in.it  ion  for  Ordinary  Differential 
Equations  [D]," 

June  1978),  200-203. 
description:   the  6-stage  Runge-Kutta-Fehlberg 
formula  of  order  4  and  5  with  global  error 
estimation  by  Richardson  extrapolation. 

b.  STINT 

documentation:   J.  M.  Tendler,  T.  A.  Bickart, 
and  Z.  Picel,  "Algorithm  534  STINT: 
STiff  (differential  equations)  INTegrator 
(D2].       ■'ans.   on  Math.   Software  4,   4 
(Dec.  1978),  399-403. 
Z.  Picel,  T.  A.  Bickart,  and  J.  M. 
Tendler,  "STINT,  STiff  ordinary  differ- 
ential equations  INTegrator,  Program  and 
user  manual,"  TR-76-12,  D.  of  Elect.  & 
Comp.  Eng. ,  Syracuse  U. ,  Syracuse,  N.Y. , 
Dec.  1976. 

description:   cyclic  composite  multistep 

methods  of  orders  1  through  7,  particu- 
larly well  suited  for  systems  wherein 
the  Jacobian  matrix  has  complex  eigen- 
values near  the  imaginary  axis. 

5.  DEPAC 

description:   systematized  collection  of  ODE 
solvers  being  developed  at  Sandia  Laboratories 

a.  RKF45 

distributors:   Prof.  Cleve  Moler  (for  the 
complete  set  of  subroutines  in  F,  M&M) 
NESC  (part  of  MATHLIB,  No.  654) 

documentation:   G.  E.  Forsythe,  M.  A.  Malcolm, 
and  C.  B.  Moler,  Computer  Methods  for 
Mathematical  Computations,   Prentice-Hall, 
Englewood  Cliffs',  N.J.,  1977. 
L.  F.  Shampine  and  H.  A.  Watts, 
"Practical  solution  of  ordinary  differ- 
ential equations  by  Runge-Kutta 
methods,"  SAND  76-0585,  Sandia  Labs., 
Albuquerque,  N.M. ,  1976. 

description:   based  on  the  6-stage  Runge- 
Kutta-Fehlberg  formula  of  order  4  and  5. 

b.  CERK 
see  4a. 

c.  ODE/DE/STEP.INTRP 

distributor:   NESC  (No.  640;  also  part  of 

MATHLIB,  No.  654) 
documentation:   L.  F.  Shampine  and  M.  K. 

Gordon,  Computer  Solution  of  Ordinary 

Differential  Equations,   Freeman,  San 

Francisco,  Calif. ,  1975. 
description:   variable-order  variable-step 

Adams  PECE  code  with  local  extrapolation. 

d.  ODERT/DERT/STEP.INTRP 
distributor:   NESC  (No.  640) 
description:   root-finder. 


5.  e.  RKSW 

disi  ribul  or:   Dr.  L.  K.  Shan; 
documentation:   L.  F.  Shampine  and  J.  A. 

Wisniewski,  "The  variable  order  Run, 
Kutta  code  kksw  ami  its  performance," 
SAND78-I       ,,<l  ia  Labs.  ,  Albuquei  que, 
N.M. ,  July  1978. 
description:   based  on  a  5-8 t age  RK  formula 
of  orders  3  and  4  and  a  13-stage  formula 
developed  by  J.  H.  Verner  of  orders  7 
and  8. 

6.  LLL  family  of  ordinary  differential  equation 
solvers 

a.  GEAR 

distributor:   NESC  (No.  592) 
documentation:   A.  C.  Hindmarsh,  "GEAR: 
Ordinary  differential  equation  system 
solver,"  UCID-30001,  Rev.  3,  Lawrence 
Livermore  Lab.,  U.  of  Calif . /Livermore , 
Dec.  1974. 
description:   revision  of  the  Gear  subroutine 
DIFSUB  based  on  Adams-Moulton  and  back- 
ward differentiation  formulas  of  various 
orders . 

b.  GEARB 

distributor:   NESC  (No.  661) 
documentation:  A.  C.  Hindmarsh,  "GEARB: 
Solution  of  ordinary  differential 
equations  having  banded  Jacobian," 
UCID-30059,  Rev.  1  with  corrections, 
L.L.L.,  July  1976. 

c.  GEARS 

distributor:   Dr.  A.  C.  Hindmarsh 
documentation:   the  source  is  self-document- 
ing as  to  usage, 
description:   uses  Yale  Sparse  Matrix 
Package;  intended  for  ODEs  having  a 
sparse  Jacobian  matrix. 

d.  GEARIB 

distributor:   Dr.  A.  C.  Hindmarsh 
documentation:   A.  C.  Hindmarsh,  "Prelimin- 
ary documentation  of  GEARIB:   Solution 
of  implicit  systems  of  ordinary  differ- 
ential equations  with  banded  Jacobian," 
UCID-30130,  L.L.L.,  Feb.  1976. 

e.  GEARB I 

distributor:   Dr.  A.  C.  Hindmarsh 
documentation:   A.  C.  Hindmarsh,  "Prelimin- 
ary documentation  of  GEARBI:   Solution 
of  ODE  systems  with  block-iterative 
treatment  of  the  Jacobian,"  UCID-30149, 
L.L.L.,  Dec.  1976. 
description:   GEAR  variant  using  block-SOR 
solution  of  linear  systems. 
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6.  f.  EPISODE 

distributor:   NESC  (No.  675) 

documentation:   A.  C.  Hindmarsh  and  G.  D. 
Bvrne,  "Applications  of  EPISODE:   an 
Experimental  Package  for  the  Integration 
of  Systems  of  Ordinary  Differential 
Equations,"  in  Numerical  Methods  for 
Differential  Systems,   L.  Lapidus  and 
W.  E.  Schiesser,  eds.,  Academic  Press, 
1976,  147-166. 

A.  C.  Hindmarsh  and  G.  D.  Byrne, 
"EPISODE:   an  Effective  Package  for  the 
Integration  of  Systems  of  Ordinary  Dif- 
ferential Equations,"  UCID-30112,  Rev.  1, 
L.L.L.,  April  1977. 

description:   variable-order  code  based  on 
variable-step   Adams-Moulton  and  backward 
differentiation  formulas. 


PART  II  -  DISTRIBUTORS  OF  CODES 

IMSL,  Inc. 

Sixth  Floor,  GNB  Building 
7500  Belaire  Boulevard 
Houston,  Texas  77036 

The  Secretary,  NAG  USA  Inc. 

1250  Grace  Court 

Downers  Grove,  Illinois  60515 

312/971-2337 

outside  North  America: 

The  NAG  Library  Service  Coordinator 
Numerical  Algorithms  Group  Ltd. 
7  Bandbury  Road 
Oxford  0X2  6NN  United  Kingdom 
Oxford  (0865)  511245 


g.  EPISODEB 

distributor:   NESC  (No.  705) 
documentation:   G.  D.  Byrne  and  A.  C. 

Hindmarsh,  "EPISODEB:   an  Experimental 
Package  for  the  Integration  of  Systems 
of  Ordinary  Differential  Equations  with 
Banded  Jacobians,"  UCID-30132,  L.L.L., 
April  1976. 

h.  EPISODEIB 

distributor:   Dr.  A.  C.  Hindmarsh 
documentation:   G.  D.  Byrne,  H.  G.  Brown, 
and  A.  C.  Hindmarsh,  "EPISODEIB:   an 
Effective  Package  for  the  Integration  of 
Systems  of  Ordinary  Differential  Equations 
of  Implicit  Banded  form,"  in  Extended 
Abstracts,  Texas  Conf.  on  Math.  Software, 
Univ.  of  Texas  at  Austin,  March  1978. 

7.  STFODE/COLODE 
distributor:   NESC  (No.  652). 
developers:   B.  L.  Hulme  and  S.  L.  Daniel, 
documentation:   SAND74-0380,  SAND74-0381, 

Dec.  1974. 
description:   collocation  methods  for  solving 
stiff  ODEs. 

8.  STIFFZ/GEARZ 

distributor:   Dr.  M.  B.  Carver 

developers:   M.  B.  Carver  and  D.  G.  Stewart 

documentation:   AECL  FTN  LIBRARY  description 

sheets  for  STIFFZ  numbered  1-17-20  and 

dated  Sept.  1978. 
description:   modifications  of  the  GEAR  routines 

of  A.  C.  Hindmarsh,  L.L.L.,  and  the  sparse 

matrix  routines  of  A.  C.  Curtis  and  J.  K. 

Reid,  Harwell . 

9.  RK58 

distributor:   Prof.  F.  E.  Cellier 
documentation:   complete  documentation  as 

header  in  the  subroutine. 
description:   based  on  the  6-stage  Runge-Kutta- 
Fehlberg  formula  of  orders  4  and  5,  as  well 
as  on  the  13-atage  RKF  formula  of  orders 
7  and  8;  user  is  allowed  to  specify  sequences 
of  termination  criteria  as  crossings 
through  zero. 
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institute  interested  in  obtaining  a  royalty-free 
license  for  the  PORT  Library.   Inquiries  for  a 
commercial  license  should  be  sent  to 

Western  Electric  Co. 

Patent  Licensing  Manager 

P.  0.  Box  20046 

Greensboro,  North  Carolina  27420 

4.  National  Energy  Software  Center  (NESC) 
Argonne  National  Laboratory 

9700  South  Cass  Avenue 
Argonne,  Illinois  60439 

how  to  obtain  code:   contact  your  institutional 
NESC  representative  or  computer  center 

5.  Professor  Cleve  Moler 
Department  of  Mathematics 
University  of  New  Mexico 
Albuquerque,  New  Mexico  87131 

6.  Dr.  L.  F.  Shampine 

Numerical  Mathematics  Division  5642 
Sandia  Laboratories 
Albuquerque,  New  Mexico  87185 

7.  Dr.  A.  C.  Hindmarsh 

Mathematics  and  Statistics  Section,  L-300 

Lawrence  Livermore  Laboratory 

P.  0.  Box  808 

Livermore,  California  94550 

8.  Dr.  M.  B.  Carver 

Mathematics  and  Computation  Branch 
Chalk  River  Nuclear  Laboratories 
Chalk  River,  Ontario 
Canada  K0J  1J0 

9.  Professor  F.  E.  Cellier 
Institute  for  Automatic  Control 

The  Swiss  Federal  Institute  of  Technology 

ETH-Zentrum 

CH-8092  Zurich  Switzerland 
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